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This  dissertation  is  an   attempt  to  m.eet  the  following 
objectives:   {1}  to  identify  potential  econorr^.ic  and  non- 
economic  factors  which  might  differentiate  between  finan- 
cially distressed  firms  and  those  which  appear  to  be 
economically  viaole;  (2)  to  statistically  determine  a  set  of 
variables  which  has  the  highest  discriminatory  power;  and 
(3)  to  suggest  both  public  and  managerial  action  v/hich  might 
be  taken  in  light  of  the  statistical  results  obtained  in 
point  2 . 

To  begin  the  analysis,  it  v/as  hypothesized  that  the 
probability  of  financial  distress  of  a  firm  at  time  t  is  a 
function  of  the  net  cash-flow  per  dollar  of  assets  at  time 
t,  the  extent  of  the  risk  exposure,  and  the  mismanagement  of 
the  firmi. 


Three  potential  apprcaches  for  predicting  failure 
are  discussed  in  the  dissertation:  "deterT.inistic ,  "  prob- 
a.bili3tic,  and  blend  ar.proach.   The  "deterministic"  approach 
was  meant  to  define  the  traditional  method  which  uses  the 
financial  radios  m  conjunction  with  the  MDA  technique. 
The  probabilistic    approach  was  stated  to  be  the  one  which 
does  not  treat  the  financial  ratios  and  other  related  forces 
as  factors  v/hose  values  can  be  determined  with  certainty  at 
any  point  of  time  in  the  near  future.   Rather,  it  attaches 
some  probability  distribution  to  each  factor,  based  upon  its 
historical  performance.   And,  instead  of  using  the  MDA 
technique,  it  relies  upon  the  modern  probability  theories. 
The  probabilistic  approach  suggested  in  this  study, 
although  intriguing,  requires  a  great  deal  of  research  be- 
fore it  can  be  put  to  work.   The  blend  approach,  used  in  the 
present  study,  was  therefore  proposed  to  act  as  a  bridge 
between  the  traditional  and  the  probabilistic  approaches. 
It  retains  most  of  the  favorable  properties  of  the  traditionc'il 
approach,  specially  with  respect  to  the  MDA  technique,  yet 
incorporates  some  facets  of  the  probabilistic  approach. 

After  presenting  an  analytic  fram.ework  for  utilization 
of  the  blend  approach  and  providing  a  groundwork  for  the  ap- 
plication of  the  MDA  technique  in  two  successive  chapters, 
an  effort  was  made  to  empirically  test  the  hypothesis.   Due 
to  the  confidentiality  of  a  part  of  financial  data,  however, 
the  hypothesis  had  to  be  modified  by  substituting  the  cash- 
flow element  with  profitability. 


xvi 


sixty  SIPC  failed  and  60  nonf ailed  firms  were  selected 
and  the  original  and  holdout  samples  were  formed  by  halving 
the  entire  observations  and  assigning  each  half  randomly  to 
one  sample.   The  main  thrust  of  the  empirical  part  rests 
upon  Test  I  whose  predictive  span  is  about  one  year.   An 
attempt  was  also  m.ade  to  extend  the  analysis  to  two  years 
prior  to  bankruptcy,  in  Test  II. 

Test  I  examines  83  variables  and  selects  12  of  them 
as  an  "optimal"  subset  by  a  multi-stage  stepwise  selection 
procedure.   The   discriminant  model  developed  in  this  test 
successfully  differentiates  between  failed  and  nonfailed 
groups.   The  quadratic  classification  procedure  correctly 
classifies  S8.33  percent  of  the  original  sample  and  mis- 
classifies,  on  average,  13.3  3  percent  of  the  holdout  sample. 
These  results,  along  with  the  results  of  applying  the 
scrambled  method,  Lachenbruch  classification  procedure,  and 
"normalized"  model,  reveal  the  relatively  high  accuracy  and 
validity  of  the  model. 

The  12  selected  variables  measure  profitability,  risk, 
and  rndsmanagement  of  a  firm.   The  relative  discriminatory 
powers  of  these  variables  in  the  model  reveal  that  the  risk 
elem.ent  is  the  most  significant  determinant  of  rhe  firm's 
survival.   Profitability  and  mismanagem.ent  are  in  the  second 
and  t>iird  places.   This  may  help  m.anagements  in  making 
decisions  v/hen  considering  risky  ventures. 


Contrary  to  Test  I,  the  result  of  Test  II  was 
expectedly  less  than  successful.   This  was  partly  due  to  the 
nature  of  the  securities  business  and  partly  due  to  the 
type  of  data  available. 
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CHAPTER  I 
INTRODUCTION 

The  Backbone  of  the  Capital  Market 

During  1974,  the  market  value  of  all  security 
transactions  on  the  United  States  registered  exchanges 
amounted  to  $124.7  billion.   The  significance  of  the  figure 
may  become  more  apparent  if  it  is  considered  in  conjunction 
with  the  nation's  Gross  National  Product  in  that  year.   The 
GNP  in  1974  amounted  to  $1,397.4  billion  and  the  value  of 
the  securities  transactions  was  as  large  as  an  amount  equal 
to  about  9  percent  of  the  GNP.   Although  this  is  the  lowest 
percentage  in  the  past  few  years  {the  highest  is  23,  in  1968), 
it  strikingly  reveals  the  important  role  that  the  securities 
broker/dealers  play  in  flowing  funds  in  the  nation's  capital 
market. 

The  huge  volume  of  securities  transactions  in  each 
year  is  handled  by  several  thousands  of  broker/dealer  firms. 
In  1973,  for  example,  the  number  of  registered  broker/dealer 
firms  was  4,088.   These  firm.s  had  employed  a  total  of  about 
349,000  employees  to  channel  the  savings  of  individuals  to 
corporate  users  and  to  transfer  the  ownership  of  outstanding 
securities  from  one  investor  to  another.   The  total  assets 
marshalled  by  the  securities  firms  having  gross  securities 


income  of  $20,000  and  over  amounted  to  $2  5,9  billion. 

In  order  to  present  a  brief  profile  of  the  securities 
broker/dealer  firms,  similar  statistics  as  those  stated  above 
are  tabulated  in  Table  1  for  the  period  1969-1974.   This  may 
aid  in  understanding  the  general  position  of  the  securities 
industry  in  the  overall  economy. 

Literature  Review 

The  maintenance  of  fair  and  orderly  markets  for 
securities  has  long  been  one  of  the  highly  desired  objectives 
of  the  legislatures,  administrators,  courts,  and  various  self- 
regulatory  organizations.   The  Securities  Act  of  1933  and 
1934  along  with  the  creation  of  the  Securities  and  Exchange 
Commission  (SEC)  to  administer  the  acts,  as  well  as  the 
pursuant  amendment  acts  of  1939,  1940,  1964,  1970,  1974,  and 
1975  partially  reflect  the  main  concern  of  these  authorities 
in  providing  conditions  designed  to  create  securities  mar- 
kets capable  of  meeting  the  nation's  need  for  investment 
capital  while  operating  in  the  public  interest  and  serving 
to  protect  the  public  investor. 

For  a  long  time,  the  going  presumption  was  that  the 
efforts  directed  toward  preventing  fraudulent,  deceptive 
and  manipulative  acts  in  the  securities,  markets  would  pro- 
tect the  individual  investors  and  the  public  interest.   It 
was  not  until  1970  that  the  hardship  and  sometimes  disaster 
which  were  faced  by  the  individual  investors  whose  broker/ 
dealers  went  bankrupt  or  underwent  liquidation  were  widely 


TABLE  1. 


SOME  SELECTED  STATISTICS  WHICH  PROVIDE  A  BRIEF  PROFILE  OF  THE  SECURITIES  BROKER/DEALER  FIRMS  FOR  THE 
YEARS  1969-74.   (WHERE  NOT  SPECIFIED,  DOLLARS  ARE  IN  MILLIONS.) 


(2) 

Volume  of 

Securities 

Transactions,  5 

(3) 

GNP 
(in  Billions, 
Current  $) 

No. 

of  Firms 

(6) 
No.  of 
Employees 
(in  Thousands) 

(7)* 
Total 
Assets 

(8)* 

Operating 

Income 

(Before 

Taxes) 

(9)* 

Return  on 
Assets  (Be- 
fore Taxes) 

(8)M7) 

(1) 

Year 

(4) 
Total 

(5) 

Firms  with 

$20,000  or 

More  Income 

1969 

179,798 

725.6 

4,681 

2,619 

366 

24,213 

561 

.023 

1970 

135,889 

722.5 

4,614 

2,332 

338 

24,442 

271 

.011 

1971 

193,821 

745.4 

4,553 

2,539 

353 

28,705 

1,279 

.044 

1972 

213,548 

790.7 

4,255 

2,512 

386 

33,728 

976 

.029 

1973 

186,173 

839.2 

4,088 

2,164 

349 

25,948 

57 

.002 

1974 

124,709 

821.2 

3,982 

NA 

NA 

NA 

NA 

NA 

*The  figures  in  this  column  are  related  to  the  firms  having  gross  securities  income  of  $20,000  and  over,  as 
reported  in  column  (5). 

IJA  indicat'is  data  not  available. 


Note:   Figures  in  column  2  are  for  the  U.S.  registered  exchanges  only. 

Source;   For  columns  (2)  and  (3),  1974  Statistical  Abstracts  of  the  United  States. 

For  columns  (4),  (5),  {6)~,     (7)  ,  and  (8),  1974  Securities  and  Exchange  Commission  Annual  Reports. 


realized.   The  substantial  operational  and  financial 
problems  that  the  securities  industry  experienced  during 
the  period  from  1967  through  1970  had  significantly  dimin- 
ished the  investors'  confidence  in  the  markets;  and  the  need 
for  urgent  corrective  action  was  felt  everywhere. 

In  December  1970,  Congress  passed  the  Securities 
Investor  Protection  Act  to  restore  public  confidence  in  the 
safety  of  the  securities  markets.   This  legislation  estab- 
lished the  Securities  Investor  Protection  Corporation  (SIPC) . 
And,  the  corporation  provided  a  limited  insurance  for  the 
customer  accounts  of  the  securities  industry. 

During  its  four  years  of  existence,  SIPC  has  placed 
a  total  of  109  failed  firms  in  liquidation.   It  has  received 
a  total  of  97,230  customer  claims.   And,  as  of  December  31, 
1974,  it  has  distributed  securities  and  cash  with  an  aggregate 
value  of  $241  million  to  satisfy  these  claims.   Of  this 
amount,  approximately  $202  million  was  derived  from  debtor 
estates.   The  remaining  $39  million,  however,  was  provided 
by  SIPC  through  its  advances  to  the  trustees.   SIPC's 
assessments  on  member  firms  during  the  four-year  period 
totaled  $106  million. 

To  minimize  the  exposure  of  the  SIPC  fund,  which  is 
backed  by  a  billion  dollars  of  taxpayers'  monies.  Congress 
directed  the  SEC,  at  the  time  of  passing  the  Securities 
Investor  Protection  Act,  to  compile  a  list  of  unsafe  and 
unsound  practices  by  broker/dealers  and  to  report  on  the 


corrective  steps  being  taken.   Before  the  SEC  report  is 
reviev.'ed ,  the  results  of  another  study  which  was  published 
prior  to  the  completion  of  this  report  is  presented. 

Reviev;  of  SEC  Records  of  the  Demise  of 
Selected  Broker/Dealers 

This  is  the  title  of  a  staff  study  undertaken  for 
the  Special  Subcommittee  on  Investigations  of  the  Committee 
on  Interstate  and  Foreign  Commerce  of  the  House  of  Repre- 
sentatives.   The  need  for  this  study  was  stimulated  by  the 
crisis  of  the  securities  industry  over  the  period  1967-70 
which  had  brought  disaster  to  numerous  broker/dealers  and 
financial  hardship  for  the  public  investors.   The  study  is 
a  review  of  the  files  of  the  commission's  headquarters  offi- 
ces for  46  selected  broker/dealers  who  went  into  some  form 
of  liquidation  during  the  3-year  period  ending  December  31, 
1970.   It  also  incorporates  additional  information  obtained 
from  other  sources;  and  includes  case  studies  for  all  46 
firms . 

The  review  concluded,  among  other  things,  that  there 
were  many  reasons  for  the  failures  of  the  subject  firms  and 
the  factors  were  both  internal  and  external  to  the  firms. 
Internal  factors  included  capital  deficiency,  operational 
losses,  mismanagement  and  inefficiency  in  operating  the 
firms'  business,  over-expansion  of  operations,  inoperable 
and  overly  expensive  modernization  programs,  back-office 
difficulties,  irresponsible  actions  by  principals,  and 


and  fraud.   External  factors  included  a  reduced  volume  of 
trading  and  a  large  decline  in  prices  of  many  stocks  during 
the  period. 

Study  of  Unsafe  and  Unsound  Practices 
of  Brokers  and  Dealers^ 

This  study  was  undertaken  by  the  regular  staff  of 
the  SEC  pursuant  to  Section  11(h)  of  the  SIPC  Act.   Like  the 
previous  study,  this  one  is  also  on  the  financial  and  opera- 
tional crisis  of  1967-70,  but  on  a  larger  scale  and  in  a  more 
comprehensive  way.   It  examines  industry  conditions  and 
practices  which  permitted  the  crisis  to  develop.   To  name 
some,  but  not  all,  the  report  cites  the  following  unsound 
practices : 

o   Inadequacy  and  inperm.anence  of  capital. 

°      Over-emphasis  on  sales  and  trading  activities 

at  the  expense  of  operational  resources. 
o   Absence  of  control  of  securities  traffic. 
«   Absence  of  an  effective  early  warning  system 
in  self-regulatory  organizations  and  their 
inability  to  respond  to  the  crisis  with 
meaningful  corrective  measures. 
o   Lack  of  experience  of  principal  members  of 

many,  mainly  small,  firms. 
The  study  reviews  corrective  measures  already 
instituted,  and  suggests  further  actions  to  be  taken  by  the 
broker/dealer  firms,  self-regulatory  organizations,  and  the 


SEC.   The  corrective  measures  detailed  in  the  report  include, 
among  other  things,  increasing  capital  requirements,  revis- 
ing or  proposing  rules  to  strengthen  control  over  securities 
and  to  provide  greater  protection  for  customers'  free  credit 
balances  and  securities  left  with  brokers,  and  taking  measures 
to  provide  the  commission  and  self-regulatory  authorities  with 
more  effective  early  warning  systems. 

Securities  Industry  Study  by  the 
SubcorrLmittee  on  Securities"^ 

During  consideration  of  the  SIPC  legislation,  the 
Committee  on  Banking,  Housing,  and  Urban  Affairs  of  the 
Senate  concluded  that  a  complete  study  of  the  securities  in- 
dustry and  the  securities  markets  should  be  undertaken  by 
its  Subcommittee  on  Securities,   And,  after  authorization  by 
the  Senate  in  June  1971,  the  study  was  launched. 

This  study  is  the  culmination  of  the  analysis  of 
the  information  obtained  from  a  variety  of  sources.   Some 
of  the  information  was  obtained  directly  from  industry  and 
government  sources  including  responses  to  a  detailed  ques- 
tionnaire on  institutional  m.embership  sent  to  104  firms, 
organizations  and  individuals.   Other  informiation  was  ob- 
tained through  holding  18  days  of  hearings  and  receiving 
testimony  from  78  witnesses.   It  includes  also  four  detailed 
case  studies  of  the  operation  of  the  regulatory  system  in 
the  securities  industry. 
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The  contents  of  the  study  give  the  impression  that 
the  sub-ccminittee  tends  to  rely  more  on  the  market  mechanism 
and  its  competitive  nature  than  on  regulations.   It  also 
looks  beyond  the  current  problems  to  the  long-range  struc- 
tural features  of  the  industry,  markets,  and  regulatory 
system. 

The  basic  conclusions  of  the  study  include,  but  are 
not  limited  to,  the  adaptation  of  greater  flexibility  in  the 
industry,  the  redefinition  of  the  division  of  responsibility 
between  SEC  and  self-regulatory  organizations,  the  provision 
of  fair,  orderly,  and  open  procedure  in  reaching  important 
decisions  by  these  organizations,  and  giving  more  weight 
to  competition  than  to  regulation.   In  the  words  of  the  sub- 
committee report: 

...Regulation,  by  government  and  by  industry 
groups,  is  an  essential  element  in  protection 
of  investors,  but  is  not  an  effective  substitute 
for  competition  in  assuring  a  flexible  and 
healthy  industry. 

The  concept  of  industry  self-regulation,  sub- 
ject to  SEC  oversight,  is  well-adapted  to 
dealing  with  problems  of  conduct  and  ethics, 
but  it  is  not  well-adapted  to  dealing  with 
general  economic  questions  involving  competi- 
tive interrelationships  among  firms  within 
the  industry ....  7 

Securities  Industry  Study  by  the  Subcomm.ittee 
on  Commerce  and  Finance^ 

This  study  was  undertaken  in  fulfillment  of  the 
commitm.ent  made  by  the  chairman  of  the  committee  on  Inter- 
state and  Foreign  Commerce  with  respect  to  the  committee's 
intention  to  make  a  complete  study  of  the  securities  market 
and  exchanges. 


The  study  is  largely  the  outgrowth  of  an  extensive 
number  of  panel  hearings ,  witness  testimonies,  views  of 
industry  leaders,  interviews  in  virtually  every  section  of 
the  country,  and  reviews  of  previous  and  current  studies  on 
the  securities  industry. 

It  includes  chapters  on  what  went  wrong  in  the 
securities  business  during  the  period  1967-70,  criminal  en- 
forcement of  securities  laws,  standards  for  entry  into  the 
securities  business,  capital  for  broker/dealers,  protection 
of  customers,  accounting  and  auditing  procedures,  opera- 
tional difficulties,  cooperative  regulations,  central  market 
system,  commission  rates,  membership  qualification  for 
exchanges,  antitrust  aspects,  and  the  SEC  budget. 

Space  does  not  allow  the  enumeration  of  the  large 
number  and  variety  of  conclusions  reached,  but  to  provide  a 
feeling  about  the  nature  of  this  study,  a  sentence  from  the 
related  letter  of  transmittal  signed  by  J.E.  Moss,  Chairman 
of  the  subcommittee,  is  quoted  here. 

This  study  is,  to  my  knowledge,  the  only 

comprehensive  congressional  examination 
of  the  securities  industry  to  have  been 
completed  within  the  time  span  of  one 
Congress .... 

SIPC  Study^ 

SIPC  has  also  made  inquiries  of  the  trustees  and 
reviewed  the  reports  of  the  staffs  of  the  self-regulatory 
organizations  participating  in  the  cases  of  the  94  broker/ 
dealer  firms  liquidated  as  of  December  31,  1973,  in  an  effort 
to  determine  the  cause  of  failure. 
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The  study  indicates  that  the  matter  of  possible  fraud 
and  manipulation  which  has  surfaced  in  many  of  these  liquida- 
tions must  be  recognized  as  a  major  factor  in  these  failures. 
Large  concentrations  of  speculative  issues  in  the  trading 
activities  contributed  to  other  failures.   Inadequate,  in- 
accurate, and  sometimes  nonexistent  books  and  records  were 
another  significant  factor.   In  many  cases,  the  operating 
management  did  not  have  the  qualifications  or  experience 
needed  to  operate  a  general  securities  business.   Lack  of 
adequate  capital  continues  to  be  mentioned  by  the  trustees 
as  a  major  factor  in  firm  failures.   According  to  the  study, 
this  condition  results  from  a  number  of  factors  including  too 
small  a  capital  base,  temporary  illiquidity,  over-commitment 
in  particular  issues,  inability  to  absorb  an  adverse  market 
movement,  too  rapid  expansion,  improper  controls,  and  operat- 
ing losses  due  to  reduced  volume.   The  study  states  that 
there  was  over-reliance  on  subordinated  capital  in  a  number 
of  instances.   In  other  cases,  the  subordination  agreement 
was  allegedly  improperly  executed,  fraudulently  induced  or 
improperly  withdrawn  placing  the  firm  in  net  capital  viola- 
tion. 

The  Early  Warning  Requirement  of  the  SIPC  Act 

Section  5(a)(1)  of  the  SIPC  Act  requires  that  the 
commission  or  the  self-regulatory  organizations  notify  SIPC 
immediately  upon  discovery  of  facts  which  indicate  that  a 
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broker  or  dealer  subiect  to  their  regulations  is  in  or 
iipproaching  financial  difficulty. 

After  recei^'ing  such  notification,  SIPC  detex-ir.ines 
whether  to  apply  for  a  court  decree  to  initiate  liquidation 
or  to  defer  action  until  further  information  is  obtained  and 
doubt  IS  removed. 

There  are  five  conditions  specified  in  Section  5(h) 
of  the  Act.   At  least  one  of  these  conditions  must  be  found 
to  exist  by  SIPC  to  apply  for  a  court  decree  and  by  the  court 
to  issue  the  decree  in  every  case.   The  court  shall  issue  a 
decree  if  it  finds  that  the  m.ember  firm: 

a.  Is  insolvent  within  the  meaning  of  Section  1(19) 
of  the  Bankruptcy  Act,  or  is  unable  to  meet  its  obligations 
as  they  mature,  or 

b.  Has  committed  an  act  of  bankruptcy  V7ithin  the 
meaning  of  Section  3  of  the  Bankruptcy  Act,  or 

c.  Is  the  subject  of  a  proceeding  pending  in  any 
court  or  before  any  agency  of  the  United  States  or  any  state 
in  which  a  receiver,  trustee,  or  liquidator  for  such  member 
has  been  appointed,  or 

d.  Is  not  in  compliance  with  applicable  requirements 
under  the  193  4  Acts  or  rules  or  regulations  of  the  comjr.ission 
or  any  self-regulatory  organizations  with  respect  to  financia] 
responsibility  or  hypothecation  of  custom.ers '  securities,  or 
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e.   Is  unable  to  make  such  computations  as  may  be 
necessary  to  establish  complaince  v-zith  such  financial  respon- 
sibility or  hypothecation  rules  or  regulations. 

As  was  mentioned  above,  the  SIPC  Act  requires  that, 
when  it  appears  to  the  SEC  or  any  self-regulatory  organiza- 
tion that  a  broker-dealer  is  in  or  approaching  financial 
difficulty,  SIPC  is  to  be  notified  immediately.   The  statute, 
however,  does  not  define  "financial  difficulty."   Neither  has 
it  seemed  practicable  for  SIPC  to  attempt  to  define  these 
terms.   In  view  of  the  number  of  self-regulatory  organiza- 
tions involved  and  the  differences  in  their  rules,  procedures, 
problems,  and  the  manner  in  which  the  various  reporting  and 
surveillance  systems  operate,  SIPC  has  considered  it  approp- 
riate, up  to  this  point  at  least,  to  work  with  each  organiza- 
tion separately. 

The  regulators  fulfill  the  Act's  requirement  through 
regulatory  procedures  which  integrate  inspection  and  report- 
ing programs  with  an  early  warning  procedure  for  notifying 
SIPC.   The  primary  objective  of  these  programs  is  the  early 
identification  of  those  members  which  are  in  or  approaching 
financial  or  operational  difficulty  and  the  initiation  of 
action  necessary  to  protect  customers. 

The  Existing  Early  Warning  Systems 

All  over  the  entire  securities  industry  emphasis  is  now 
on  the  early  warning  systems  by  which  the  financial  difficulty 
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of  the  broker/doalors  is  to  bo  detected  in  advance.   The  two 
main  purposes  of  such  early  warning  systems  are:   (1)  to 
afford  a  self-regulatory  organization  an  opportunity  to 
initiate  procedures  so  that  the  broker/dealer  in  question 
does  not  become  a  SPIC  casualty,  and  (2)  to  provide  a  basis 
for  notification  to  SIPC.   Inasmuch  as  these  warning  systems 
are  directly  related  to  the  subject  matter  of  the  present 
study,  further  review  of  the  related  material  seems  approp- 
riate. 

The  early  warning  system  of  the  SEC. "''-'"   The  SEC,  in 
discharging  its  duties,  periodically  reviews  through  on-site 
inspections  and  in-house  studies  the  early  warning  surveil- 
lance tools  of  the  self-regulatory  organizations  to  ensure 
that  they  constitute  sound  and  effective  programs. 

The  commission's  early  warning  and  surveillance  tools 
include  Rule  17a-ll,  Rule  17a-5(j),  and  Rule  17a-10.   Rule 
17a-ll  was  adopted  to  provide  the  comjnission  and  the  self- 
regulatory  authority  with  an  adequate  and  timely  flow  of 
information  on  the  financial  and  operational  condition  of 
broker/dealers.   Among  other  provisions,  it  requires  a 
broker/dealer  to  notify,  by  telegram,  the  commission  and  the 
appropriate  self-regulatory  organization  if  it  breaks 
through  certain  specified  financial  or  operational  parameters. 
Rule  17a-5(j)  requires  a  broker/dealer  to  notify  the  commis- 
sion if  its  exchange  mcombership  is  terminated  and  becomes 
subject  to  the  commission's  net  capital  rule.   Rule  17a-10 
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requires  a  broker/dealer  to  file  Form  X-17A-10  annually  with 
the  commission. 

An  additional  financial  responsibility  rule,  Rule 
15c3-3,  titled  "Customer  Protection--Reserves  and  Custody  of 
Securities,"  became  effective  on  January  15,  1973.   Failure 
to  comply  with  the  requirements  of  this  rule  has,  in  a 
number  of  instances,  been  the  basis  of  the  statutory  condi- 
tion. 

In  addition  to  these  rules,  the  commission,  working 
together  with  the  NASD  and  the  self-regulatory  authorities, 
has  developed  a  monthly,  combined  early  warning  list  and  a 
Rule  17a-ll  list  for  each  regional  office.   Each  month  the 
self -regulatory  organizations  submit,  for  the  combined  list, 
data  pertaining  to  the  financial  condition  of  troubled  firms 
within  each  commission  region.   The  list  is  then  speedily 
transmitted  to  each  regional  office  chief  examiner  for  his 
verification  within  24  hours. 

Finally,  in  many  cases,  the  commission's  application 
for  an  injunction  and  the  appointment  of  a  receiver  has 
typically  been  based  on  violations  of  the  net  capital  rules 
or  such  inadequacy  of  books  and  records  that  the  firm  is  un- 
able to  make  such  computations  as  may  be  necessary  to 

establish  compliance  with  the  rules  concerning  financial 
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responsibility  or  hypothecation  of  customers'  securities. 

The  early  warning  system  of  the  self-regulatory 
organizations .   Over  the  years,  the  exchanges  and  the  NASD 
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each  has  developed  a  complex  system  of  surveillance  both  on 
its  own  initiative  and  in  compliance  with  Federal  laws.   As 
a  result,  many  different  early  warning  systerus  have  emerged. 
Space  does  not  allow  a  review  of  all  of  these;  systems  indi- 
vidually, here.   It  may,  however,  be  relatively  safe  to 
generalize  that  a  self-regulatory  organisation  places  a  mem- 
ber firm  under  a  special  surveillance  if  its  net  capital  is 
deficient,  the  ratio  of  aggregate  indebtedness  to  net  capital 
exceeds  the  related  rules  or  guidelines,  or  if  the  firm's 
operating  losses  in  a  given  period  exceed  certain  criteria. 
As  an  example,  the  New  York  Stock  Exchange  has  the  following 
early  warning  criteria  for  carrying  organizations: 

.1.   Rule  325  Capital  Ratio  at,  or  in  excess  of 

1,000  percent--as  of  the  date  of  the  Exchange's 
Early  Warning  Report. 

2.  Cumulative  losses  during  the  most  recent  three- 
month  period  equal  to,  or  in  excess  of,  50  per- 
cent of  the  organization's  tentative  net  capital, 

3.  Any  firm  likely  to  begin  liquidation  within  the 
next  six  m^onths  because  of  losses,  or  scheduled 
withdrawals  of  capital,  which  would  jeopardize 
their  financial  viability. 

The  final  point  to  be  noted  relates  to  the  case  of 
NASD.   The  organization  has  recently  attempted  to  add  a  new 
and  significant  feature  to  its  operating  early  warning  system, 
Due  to  the  similarity  that  exists  between  this  feature  and 
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the  theme  of  the  present  study,  it  will  be  discussed  in  the 
following  section  in  more  derail.   At  this  point,  however, 
suffice  it  to  say  that,  v;hile  all  the  previously-reviewed 
studies  and  the  warning  procedures  base  their  analysis  on 
the  individual  effects  of  a  criterion  in  a  selected  set  of 
criteria,  the  NASD ' s  additional  feature  and  the  present  work 
consider  the  collective  effects  of  such  factors.   The  two 
approaches  m.ay  bring  about  different  solutions  to  the  same 

problem.   But,  the  superiority  of  the  second  approach  has 

14 
been  well  established. 

As  an  example,  consideration  of  a  violation  of  the 
net  capital  rule  on  the  basis  of  its  individual  effect  may 
lead  to  a  different  action  from  that  which  may  be  induced 
when  its  effect  is  considered  collectively,  m  combination 
v/ith  some  other  factor  (s)  such  as  net  income,  for  example. 
As  once  noted  by  SIPC,  although  in  a  little  different  con- 
text , 

...A  violation  of  the  net  capital  rule  m.ight 
not  portend  as  serious  a  situation  from  the 
point  of  view  of  customer  protection  as 
originally  feared.   This  rule  basically  is  a 
test  of  liquidity  as  of  a  particular  time. 
It  does  not  necessarily  follow  that  a  tem- 
porary or  possibly  inadvertant:  failure  to 
comply  with  the  net  capital  rule  makes  losses 
to  customers  inevitable. ^5 

The  NASD  Failed  Firms  Project 

When  the  present  study  was  half-way  through,  it  was 
found  out  that  NASD  has  also  undertaken  a  similar  study,  but 


it  has  not  been  published  yet.  '   It  was  conducted  by  E.J, 
Dervan  and  E.I.  Altman.   It  applies  the  Multivariate  Dis- 
criminant Analysis  (MDA)  technique  as  a  tool  for  predicting 
the  failure  of  firms. 

The  Failed  Firms  Project  uses  a  sample  of  two  group 
firms:   The  Failed  Group  included  40  firms  which  were  under 
SIPC  liquidation;  the  Active  Group  consisted  of  113  firms. 
None  of  the  firms  was  a  member  of  the  New  York  Stock 
Exchange  or  American  Stock  Exchange.   The  financial  data 
on  these  firras  were  obtained  from  the  related  Annual  Finan- 
cial Reports,  Form  X-17A-10. 

The  project  had  been  launched  to  determine  if  the 
reported  financial  experience  of  failed  firms  prior  to 
their  failure  could  be  utilized  to  establish  an  early  warn- 
ing system  for  identifying  potential  failures.   For  this 
purpose,  it  was  reported  that  a  total  of  82  financial  ratios 
and  non-financial  indicators  such  as  age  and  type  of  organiza- 
tion were  exam.ined.   The  following  six  variables  were  selected 
as  doing  the  best  job,  collectively,  in  discriminating  be- 
tween active  and  failed  firms: 

1.  Net  Income  After  Taxes/Total  Assets. 

2.  (Total  Liabilities  +  Subordinated  Loans) /Owner ' s 
Equity . 

3.  Total  Assets/Adjusted  Net  Capital. 

4.  (Ending  Capital-Capital  Additions) /Beginning  Capital. 


5.  Scaled  Age. 

6.  Composite  (based  on  the  ten  separate  variables 
treated  in  a  dichotonous  way) . 

Among  conclusions  of  the  report  were  that  a  majority 
of  failures  arise  from  a  relatively  small  subset  of  NASD 
member  firms;  the  constructed  model  proved  to  be  highly  accu- 
rate by  classifying  90.1  percent  of  an  original  sample  of 
failed  and  active  firms;  the  model  is  highly  reliable,  and 
it  appears  to  provide  significant  lead  time  in  which  to 
initiate  corrective  action. 

The  discussion  about  the  results  of  the  NASD  study 
and  their  comparison  v/ith  those  of  the  present  study  is 
partly  relegated  to  the  next  section  and  partly  to  Chapter 
IV,  where  the  results  of  this  study  are  presented  in  detail. 

Differences  Between  the  NASD  Study 
and  the  Present  Work 

Although  the  two  studies  are  similar  in  a  number  of 
ways,  including  the  application  of  the  MDA  technique,  they 
differ  from  each  other  in  analytic  fram.ework,  source  of 
data,  data  set,  and  results.   More  specifically,  the  dif- 
ferences lie  in  the  following  areas: 

The  Analytic  Framework 

The  NASD  study  uses  the  traditional  approach,  which 
may  be  dubbed  "deterministic"  to  differentiate  from  the 
"probabilistic"  approach  being  presented  in  the  following 
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section,  in  building  up  the  prediction  model.   Whereas,  the 
approach  taken  in  the  present  study  is  a  blend  of  "determinis- 
tic" and  "probabilistic"  approaches.  This  point  will  become 
more  clear  when  the  following  section  on  "Different  Ap- 
proaches" is  reviewed. 

The  Variable  Sets 

Although  the  complete  list  of  the  mentioned  8  2 
variables  in  the  NASD  study  has  not  been  reported,  it  is  most 
likely  that  the  two  variable  sets  used  in  building  up  the 
models  differ  from  one  another.   The  difference  is  partially 
due  to  the  introduction  of  the  "probabilistic"  variables  in  the 
present  study,  such  as  standard  deviation  of  some  financial 
ratios  and  a  version  of  Tchebycheff ' s  inequality  for  estimat- 
ing the  probability  that  the  value  of  some  variables  fall 
below  a  disaster  level,  and  partly  due  to  the  sources  of 
data  which  will  be  discussed  below. 

The  Sources  of  Data 

Form  X-17A-10  was  the  source  of  data  for  the  original 
model  of  the  NASD  study.  This  form  contains  most  of  the 
financial  data  which  are  ordinarily  included  in  Income  State- 
ment and  Balance  Sheet.  The  present  study,  however,  has  used 
inform.ation  contained  in  Form  X-17A-5.  This  form  contains 
only  items  which  are  ordinarily  included  in  Balance  Sheet  and 
lacks  any  income-statement  item.   The  choice  of  the  source 
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for  the  present  study  was  inevitably  due  to  the  inavalibilitv 
of  Form  X-17A-10  to  the  public. 

Data  Set 

In  the  NASD  study  only  an  original  sample  has  been 
employed  and  it  has  consisted  of  40  failed  firms  and  113 
non-failed  firms.   In  the  present  study,  two  xndependent 
samples,  i.e.,  the  original  sample  and  the  holdout  sample, 
are  used;  each  sample  has  30  failed  firms  and  30  non-failed 
firms. 

The  Results 

It  is  very  difficult,  if  not  impossible,  to  compare 
the  results  of  the  two  studies.   The  first  reason  is  the  lack 
of  the  very  important  financial  elements,  i.e.,  the  income- 
statement  items,  in  the  source  of  data  of  the  present  study. 
The  second  reason  is  the  absence  of  similar  counterparts  in 
the  two  studies,  especially  with  respect  to  the  model  which 
predicts  the  bankruptcy  two  years  in  advance.   The  third,  but 
not  the  last,  is  the  lack  of  the  holdout  sample  in  the  NASD 
study  which  might  have  been  due  to  the  nonexistence  of  suffi- 
cient numbers  of  observations  at  the  time  of  the  study. 
Given  these  limitations,  a  comparison  will  be  made  in  Chapter 
IV,  wherever  it  is  feasible. 
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Different  Approaches 

In  predicting  the  financial  position  of  a  firm  in  a 
near  future,  there  are  at  least  three  potential  approaches: 
The  first  one,  which  has  already  been  put  to  use,  is  finan- 
cial ratio   analysis,  previously  on  the  univariate  basis  and 
recently  on  the  multivariate  scale.   The  approach  may  be 
dubbed  deterministic.   The  second  one,  which  has  not  yet  been 
developed  is  a  probabilistic  approach.   The  reasons  for  con- 
sidering such  an  approach,  along  with  a  brief  illustration  of 
the  concept,  will  be  given  in  the  following  paragraphs.   Then, 
the  difficulties  associated  with  its  implementation  are  dis- 
cussed.  As  a  consequence,  the  third  potential  approach, 
which  is  a  compromise  between  the  first  and  second,  emerges. 
This  last  approach  is  the  one  which  has  been  adopted  in  the 
present  study. 

The  Deterministic  Approach 

18 
In  the  early  years  of  the  1930 's.  Smith  and  Winakor 

undertook  a  study  of  133  financially  troubled  companies  and 

concluded  that  Working  Capital/Total  Assets,  Net  Worth/Total 

Debt,  and  Net  Work/Fixed  Assets  were  effective  indicators  of 

financial  difficulties  in  firms.   In  19'31,  in  another  study, 
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Fitzpatrick    employed  13  different  ratios  and  concluded  that 

all  the  ratios  predicted  failure  to  an  extent,  and  that  the 
best  predictors  were  Net  Worth/Total  Debt  and  Net  Profit/Net 
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Worth.   Similar  efforts  were  later  made  by  Merv/in,    Tamari,  •'- 
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and  Beaver.     But,  none  of  these  researchers  came  to  the 

same  conclusion.   What  they  had  in  common,  however,  was 
their  approach  to  the  problem  which  was  based  on  the  uni- 
variate analysis  of  the  financial  ratios. 
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Later,  in  19  67,  Altman    improved  upon  the  above 

approach  by  taking  advantage  of  the  properties  of  the  multi- 
variate discriminant  analysis  technique.   This  technique, 
which  drastically  reduces  the  analyst's  space  dimensionality 
and  automatically  assigns  appropriate  weights  to  appropriate 
variables,  removed  some  of  the  ambiguities,  confusions,  and 
difficulties  in  the  interpretation  of  the  traditional  ratio 
analysis.   Since  then  this  approach  has  been  utilized  in 
predicting  the  financial  position  of  firms  in  various 
industries . 

Although  the  recent  method  is  both  theoretically  and 
operationally  superior  to  the  traditional  ratio  analysis,  it 
still  shares  some  drawbacks  with  the  latter:   Both  methods 
treat  the  process  in  such  a  way  as  though  the  property  of 
cause  and  effect  relation  perfectly  holds  in  a  deterministic 
manner.   The  implied  assumption  is  that  a  specific  stimulus 
or  a  set  of  stimuli  (i.e.,  changing  the  financial  ratios) 
produces  a  specific  response  from  the  firm  (i.e.,  failure). 
And,  the  required  effort  to  predict  the  firrri's  state  is 
solely  to  identify  such  stim.uli.   Furthermore,  it  is 
implicitly  assumed  that  the  firm  is  in  isolation  and  no 
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external  influence  (such  as  changes  in  the  market  and  eco- 
nomic conditions)  affects  thG  future  financial  position  of 
the  firm.   In  other  words,  the  implied  assumption  is  that 
the  state  of  the  firm  at  some  time  in  the  future  is  determin- 
istically  ascertained  by  its  current  and  past  states. 

In  the  face  of  widely  prevalent  uncertainty  around 
every  aspect  of  the  business  world,  it  is  obvious  that 
reliance  on  such  far-out-of-reality  assumptions  is  not 
plausible.   In  fact,  the  firm  is  a  system  that  can  be  re- 
sembled to  a  "grey  box,"  about  which  we  have  only  a  limited 
knowledge.   The  numerous  subsystems,  including  each  indi- 
vidual, with  his  own  capricious  and  ever-changing  charac- 
teristics, constituting  the  firm,  and  numerous  systems  and 
subsystems  building  up  the  socio-economic  environm.ent  in 
which  the  firm  exists  makes  any  certain  inference  or  pre- 
diction about  the  firm's  behavior  very  difficult,  if  not 
impossible.   The  complexity  of  actions  and  interactions 
between  these  many  systems  and  subsystems  are  such  that  we 
may  never  be  able  to  assert  with  certainty  that  a  specific 
input  to  the  firm  yields  a  specific  output.   Under  such 
conditions,  the  best  way  that  we  may  expect  is  a  vector  of 
possible  outcomes  (rather  than  a  single . certain  outcome) 
with  somie  probabilities  attached  to  each  of  them. 

The  Probabilistic  Approach 

This  approach,  which  has  not  been  developed  yet,  is 
based  upon  the  same  concept  which  drove  the  statistician  to 
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replace  the  complexity  by  uncertainty.   The  statistician  who 
knew  that  in  tossing  a  coin  the  prediction  of  the  ultimate 
position  of  the  coin  is  conceivably  possible  by  using  the 
laws  of  mechanics  (given  sufficient  information  about  its 
initial  position,  the  way  in  which  it  is  thrown,  and  the 
specifications  of  the  surface  on  which  it  falls)  concluded 
that  it  is  far  too  difficult  a  problem  for  contemporary  science 
to  be  solved  in  this  way.   He  therefore  suggested  a  more  op- 
erational way  of  predicting  the  result  of  the  tossing — the 
0.5  probability  of  getting  heads  if  the  coin  is  a  fair  one. 

Yet,  the  prediction  of  the  firm's  financial  position 
is  not  as  simple  as  that  of  a  fair  coin.   The  firm's  posi- 
tion is  a  combinatorial  phenomenon  which  is  the  result  of  yet- 
unexplored  special  combinations  of  numerous  internal  and  ex- 
ternal (to  the  firm)  factors,  each  of  which  has  a  special 
probability  distribution  of  outcome.   The  concept,  neverthe- 
less, is  straightforward. 

To  illustrate,  suppose  that  the  probability  of  the 
firm's  financial  distress  is  affected  by  only  three  proba- 
bilistic factors  A,  B,  and  C,  whose  a  priori  probability 
distributions  may  be  estimated  through  past  experiments.   If 
these  three  factors  were  mutually  exclusive,  the  probability 
of  financial  distress  would  equal  to  the  probability  of  get- 
ting any  one  of  these  three  events.   That  is: 

P(FD)  =  P(A  or  B  or  C)  =  P(A)  +  P(B)  +  P(C)       (1) 
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If  A,  B,  and  C  are  not  mutually  exclusive,  then  equation  (1) 
would  not  hold.   Depending  ur.on  the  anticipated  influence  of 
these  factors  on  the  probability  of  financial  distress,  there 
are  some  other  choices  available.   If  we  have  reasons  to 
believe  that  each  factor  has  effects  individually ,  the  more 
relevant  equation  may  be  the  following: 

P(FD)  -^-    P{A  or  B  or  C)  =  P  (A)  +  P{B)  +  P(C)  -  P  (A  &  B) 
-  P(B  &  C)  -  P(A  &  C)  (2) 

On  the  other  hand,  if  the  available  evidence  suggests  that  the 
effect  comes  through  the  joint  occurrence  of  all  of  the  three 
factors,  then,  equation  (2)  does  not  explain  the  case.   If 
A,  B,  and  C  are  statistically  independent,  the  proper  equa- 
tion may  be  the  following: 

P(FD)  =  P(A  &  B  &  C)  =  P(A)P(B)P(C)  (3) 

Otherwise,  equation  (4)  may  be  more  relevant: 

P(FD)  +  P(A  &  B  &  C)  =  P(A)P(b!a)P(C|A  &  B)        (4) 

The  crux  of  the  problem  lies  in  the  fact  that  we  know 
neither  all  of  the  probabilistic  factors  affecting  the 
probability  of  financial  distress,  nor  the  relationships  be- 
tween such  factors.   While  some  of  these  factors,  such  as 
long-term  debt  and  short-term  debt,  may  be  mutually  exclu- 
sive, other  factors  such  as  revenues  from  different  product 
lines  may  not  necessarily  be  so,  and  still  other  factors 
such  as  cash-flow  and  expenses  may  be  statistically  dependent. 
Furthermore,  it  is  conceivable  that  the  conditional 
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probabilities  of  two  factors  may  not  be  equal,  i.e.  P(a1b)  7^ 
P(BJA).   Such  a  case  may  arise  when  A  represents,  for  ex- 
ample, the  revenue  of  the  firm  and  B  the  general  level  of 
economic  activities.   Thus,  the  relevant  probability  equa- 
tion of  the  firm  cannot  be  a  simple  equation  such  as  any  one 
of  those  mentioned  above.   Conversely,  it  is  likely  that  the 
function  is  a  corrLbination  of  a  great  number  of  such  equations, 
embracing  a  v/ide  range  of  relevant  factors. 

The  Blend  Approach 

Although  the  probabilistic  approach  seems  to  be 
intriguing,  it  requires  a  great  deal  of  research  before  it 
can  be  actually  put  to  work.   More  specifically,  the  research 
needed  comprises:   (1)  identifying  all  the  probabilistic 
factors  affecting  the  probability  of  the  firm's  financial 
distress.   (2)  determining  the  inter-relationships  between 
these  factors.   (3)  finding  a  suitable  V7ay  to  estimate  the  a 
priori  probabilities  of  such  factors,  and  finally  (4)  for- 
mulating the  probabilistic  model. 

Faced  with  such  a  huge  work,  one  may  think  about  the 
ways  of  reducing  the  load.   One  way  to  do  this,  is  to  bridge 
between  the  deterministic  and  the  probabilistic  approach.   And 
this  is  the  path  taken  in  the  present  study. 

While  the  blend  approach  retains  all  the  favorable 
properties  of  the  deterministic  approach,  especially  with 
respect  to  the  utilization  of  the  MDA  technique,  it  also 
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incorporates  some  facets  of  the  probabilistic  approach.   Such 
a  midway  selection  may  ultinately  be  helpful  in  the  develop- 
ment of  the  potentially  more  comprehensive  probabilistic 
approach . 

The  Purpose  of  the  Study 

The  purpose  of  this  study  is  three-fold: 

1.  To  identify  potential  econom.ic  and  non-economic 
factors  which  might  differentiate  between  finan- 
cially distressed  securities  brokerage/dealer 
firm.s  and  those  which  appear  to  be  economically 
viable. 

2.  To  statistically  determine  a  set  of  variables, 
from  amiong  the  factors  in  point  1,  which  has  the 
highest  discriminatory  power. 

3.  To  suggest  both  public  and  managerial  action 
which  might  be  taken  in  light  of  the  statisti- 
cal results  obtained  in  point  2. 

The  Problem  Encountered 

When  the  work  on  the  present  study  had  been  initiated, 
it  was  hoped  that  no  serious  problem  would  be  encountered  with 
respect  to  the  data  needed  to  test  the  suggested  hypothesis. 
Unfortunately,  however,  when  the  work  on  the  theoretical  part 
of  the  dissertation  was  finished,  and  the  time  for  collecting 
the  data  came,  the  hope  was  replaced  by  regret. 
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The  Public  Reference  Office  en  the  SEC,  although  co- 
operative in  supplying  the  financial  Forms  X-17A-5  of  the 
selected  companies,  denied  the  accessibility  of  Forms  X-17A- 
10,  on  the  ground  that  they  were  confidential. 

Since  Form  X-17A-5  contains  only  items  similar  to 
those  of  Balance  Sheet  and  lacks  about  half  of  the  highly 
valuable  financial  inform.ation  which  is  contained  in  Form 
X-17A-10,  especially  with  respect  to  the  revenue  and  expense 
items,  a  full  test  of  the  suggested  hypothesis  is  not  pos- 
sible with  the  available  data.   It  is  hoped  that  in  evalua- 
tion of  the  results  of  the  study,  this  limitation  is  kept  in 
mind.   The  results  might  have  been  better  if  all  the  finan- 
cial information  could  have  been  included  in  the  analysis. 

Th e  Org anization  of  the  Dissertation 

After  the  introductory  chapter,  the  analytic  frame- 
work is  set  forth  in  Chapter  II  which  discusses  the  suggested 
hypothesis  and  determines  the  potential  factors  affecting 
the  probability  of  financial  distress  by  expanding  the 
hypothesized  function.   Included  in  this  chapter  is  a 
section  on  risk  which  explores  the  possible  measures  of 
risk  and  risk  factors. 
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Chapter  III  provides  a  groundwork  for  the  use  of  the 
MDA  technique  in  the  analysis.  It  presents  an  overall  pic- 
ture of  the  MDA  technique  in  application  in  a  suminary  form. 

Chapter  IV  integrates  the  materials  included  in 
Chapters  II  and  III  into  an  empirical  experiment  consisting 
of  Test  I  and  Test  II.   This  chapter  describes  the  sources 
of  data  and  data  set,  the  procedure  used  in  selecting  samples, 
and  the  research  methodology  em.ployed  to  em.pirically  test  the 
hypothesis.   The  greater  part  of  this  chapter  is  devoted  to 
Test  I. 

Finally,  Chapter  V  comprises  a  summary  of  the 
dissertation,  the  conclusion  reached  by  the  empirical  ex- 
periment, the  implication  of  the  results  found,  and  the 
suggestions  for  further  research. 
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CHAPTEx''.  II 
THE  GENERAL  ANALYTIC  FRAMEWORK 

Introduction 

This  chapter  presents  an  analytic  framework  for 
identifying  potential  determinants  of  a  firm's  financial 
distress.   The  analytic  framework  is  presented  here  in  a 
general  form,  potentially  applicable  to  all  profit- 
oriented  private  organizations.   Its  application  to  secu- 
rities brokerage/dealer  firms  will  be  discussed  in  Chapter 
IV. 

The  analysis  begins  with  the  hypothesis  that  the 
probability  of  financial  distress  of  a  firm  is  a  function 
of  net  casli-flow  per  dollar  of  assets,  risk  exposure,  and 
mismanagement  of  the  firm.   By  decomposing  the  cash-flow 
equation,  internal  and  external  factors  affecting  revenue 
and  expenses  are  examined  in  some  detail.   Risk  is  then 
defined,  and  after  discussion  of  several  measures  of  risk, 
risk  factors  associated  with  different  facets  of  a  firm 
are  explained.   Finally,  a  brief  discussion  is  given  of 
factors  comprising  the  "mismianagement"  element  in  the 
hypothesized  function. 
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The  analysis  results  in  a  widely  expanded  general 
function  that  contains  a  large  number  of  factors,  each  of 
which  possibly  affects  the  probability  of  a  firm's  finan- 
cial distress.   What  is  needed,  however,  is  a  parsimonious 
function  which  contains  only  a  fev?  important  factors  with 
specific  relations  among  them.   Such  a  desired  specific 
function  may  be  then  derived  from  the  expanded  general 
function  by  utilizing  the  multivariate  discriminant  analy- 
sis described  in  Chapter  III. 

The  Hypothesis 

Every  industry  has  its  own  problems  and  peculiari- 
ties.  So  does  each  firm  within  each  industry.   Machine 
tool  companies  suffer  from  significant  svi/mgs  in  scales, 
steel  companies  undergo  labor  strikes,  and  oil  companies 
are  highly  dependent  on  geological  discoveries.   Food 
companies  demonstrate  sales  stabilities,  public  utilities 
are  regulated  by  governments,  and  business  machine  com- 
panies are  characterized  by  grov/th. 

In  such  diverse  surroundings  any  specific  analytic 
fram.ework  for  identifying  potential  determ.inants  of  a 
firm's  financial  distress  which  fits  a,   certain  industry 
will  not  be  of  much  help  to  another  industry.   What  is 
needed,  then,  is  a  general  analytic  framework  which  may 
be  applicable  to  any  profit-oriented  private  organization, 
large  or  small,  manufacturing  or  servicing. 
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The  effort  undertaken  in  this  chapter  hopefully 
will  meet  such  a  need.   It  is  necessary,  however,  to  treat 
most  factors  as  being  equally  important.   This  is  due  to 
the  fact  that  a  factor  which  is  relatively  less  important 
in  one  industry  may  be  of  greater  importance  in  another 
one.   Such  a  treatment,  however,  may  not  be  hannful,  be- 
cause, in  applying  the  analytic  framework  to  a  specific 
industry,  appropriate  weights  will  be  assigned  to  the  rela- 
ted factors. 

To  start  the  analysis,  it  is  hypothesized  that  the 
probability  of  financial  distress  of  a  firm  at  tim.e  t, 
P{FD  ) ,  is  a  function  of  the  net  cash-flow  per  dollar  of 
assets  at  time  t  (CASHFA  ) ,  the  extent  of  the  risk  assumed 
by  and  imposed  upon  the  firm  (RISK) ,  and  the  mismanagement 
of  the  firm  (MISMGT) . 

P(FD  )  =  f  (CASHFA^,  RISK,  MISMGT)  (1) 

The  inclusion  of  these  elements  in  equation  (1)  is 
the  consequence  of  the  following  line  of  reasoning: 

With  respect  to  the  net  cash-flow  per  dollar  of 
assets,  it  is  both  an  index  of  the  profitability  of  the 
firm  and  that  of  the  availability  of  funds  to  meet  the 
fixed  obligations  in  time.   This  index,  which  has  been 
relieved  from  size  effect  of  the  firm,  can  in  fact  be 
divided  into  two  parts,  the  first  part  being  the  rate  of 
return  on  equity  capital  and  the  second  part  being  a  fund 
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ratio  added  to  or  deducted  from  this  rate  due  to  the  changes 
in  assets  (including  decrease  in  the  amount  of  fixed  assets 
and  increase  in  cash  due  to  the  depreciation  allowances  or 
assets  sales)  and  liabilities. 

The  rate  of  return  on  equity  capital  should  have  a 
direct  effect  on  the  probability  of  financial  distress.   If 
the  firm  has  a  high  rate  of  return  (compared  with  either 
its  cost  of  capital  or  the  market  risk-free  rate  of  inter- 
est) ,  it  has  a  lower  probability  to  become  financially 
distressed  than  otherwise,  other  things  held  constant.   The 
reason  is  that  the  high  rate  of  return  indicates  that  the 
firm,  in  addition  to  wholly  or  partially  sat:isfying  the 
expectations  of  the  equity  owners,  depending  on  the  height 
of  the  rate  and  the  level  of  the  risk,  has  been  able  to 
meet  the  requirements  of  the  owners  of  debt  and  preferred 
capital,  thus  alleviating  situations  which  threaten  the 
existence  of  the  firm.   By  contrast,  a  low  rate  of  return, 
and  in  extreme,  a  negative  rate  of  return,  signifies  im- 
pending inability  of  the  firm  to  meet  its  obligations  and/or 
probable  depletion  of  the  equity  capital  in.  shock  absorbing 
the  adverse  situations.   Such  situations  may  develop  very 
rapidly  in  some  industries  such  as  the  securities  busi- 
ness where  the  value  of  m-ost  assets  is  subject  to  the 
changes  in  m.arkat  and  business  activities  and  the  ratio 
of  equity  to  debt  capital  is  as  high  as  1  to  15.   There- 
fore, the  lower  is  the  rate  of  return  on  equity  capital. 
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the  higher  is  the  probability  of  financial  distress,  other 
things  being  equal. 

Although  the  rate  of  return  on  equity  capital 
indicates  the  profitability  of  the  firm,  there  have  been 
occasions  that  the  profitable  firm  {as  indicated  by  this 
rate)  has  been  unable  to  meet  its  fixed  obligations  in 
time,  due  to  unavailability  of  sufficient  cash  in  its 
account.   On  some  other  occasions  the  reverse  has  been 
observed.   That  is,  a  temporarily  unprofitable  firm  which 
has  had  access  to  sufficient  funds  has  escaped  from  finan- 
cial distress. 

Therefore,  it  is  expected  that  cash-flow  per  dollar 
of  assets  has  a  higher  impact  on  the  probability  of  finan- 
cial distress  than  the  corresponding  rate  of  return  on 
equity. 

Regarding  the  risk  element  in  equation  (1) ,  it  is 
a  complex  factor  which  reflects  the  overall  risk  exposure 
of  the  firm.   It  is  a  combination  of  the  risk  factors 
assumed  deliberately  by  the  firm  itself  and  those  imposed 
upon  it  by  external  forces.   The  factors  range  from  those 
associated  with  liquidity,  borrowing  and  lending,  to 
business  activities,  and  to  changes  in  market  and  the 
economy  in  general.   This  element  will  be  discussed  later 
in  more  detail.   But,  at  this  point,  it  is  necessary  to 
present  the  reason  of  its  inclusion  in  equation  (1) . 
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By  any  relevant  definition  of  risk  in  this  contaxt, 
one  would  expect  to  observe  some  unfavorable,  as  well  as 
favorable,  impact  of  risk  factors  on  the  perfonr.ance  of  the 
firm.   Thus,  it  seems  to  be  relevant  to  relate  the  prob- 
ability of  financial  distress  to  risk.   To  illustrate  the 
point,  consider  two  such  risk  factors,  i.e.,  variance  of 
the  rate  of  return  and  financial  leverage. 

Since  the  income  of  a  firm  is  subject  to  the  changes 
in  the  level  of  general  business  activities  (as  well  as  to 
many  other  factors) ,  its  rate  of  return  is  a  random  vari- 
able.  Therefore,  the  rate  of  return  may  vary  from  a  low 
point  to  a  high  point  over  time.   If  the  range  of  this 
variation  is  very  wide,  other  things  being  equal,  there  is 
a  higher  probability  of  facing  financial  distress,  as  a 
result  of  several  consecutive  occurrences  of  low  rates, 
than  otherwise. 

Similar  reasoning  applies  for  the  dependency  of 
the  probability  of  financial  distress  upon  the  financial 
leverage.   The  owners'  equity  capital  acts  as  a  shock 
absorber  in  a  car.   If  the  equity  capital  constitutes  a 
larger  part  of  the  total  capital  of  the  firir.,  there  is  a 
broader  base  to  support  the  burden  of  adverse  situations. 
That  is,  when  several  consecutive  losses  occur,  the  fixed 
charges  of  debt  capital  and  other  liabilitias  can  be  paid 
through  digging  into  the  equity  capital.   But,  if  the 
equity  portion  of  the  capital  is  not  sufficiently  large, 
such  obligations  cannot  be  met  adequately.  — 
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With  respect  to  the  mismanagement  element  (MISMGT) 
in  equation  (1) ,  the  word  may  not  suitably  explain  the 
purpose.   It  was,  however,  intended  to  eiribrace  and  describe 
all  factors,  such  as  fraud  and  poor  books  and  records, 
which  do  not  fall  within  the  domain  of  the  other  two  ele- 
ments in  the  equation  but  have  some  impact  on  the  prob- 
ability of  the  firm's  financial  distress. 

In  equation  (1)  the  relationships  between  the 
independent  and  dependent  variables  were  considered  only 
in  a  general  form.   It  is,  however,  desirable  to  obtain 
at  least  the  approximation  of  the  true  relations  in  a 
specific  form.   Since  each  of  the  above  independent  vari- 
ables is  under  the  influence  of  numerous  factors  with  some 
possible  com.plex  interactions  among  them,  error  in  com- 
puting the  approximation  through  their  direct  measurements 
(which  is  only  possible  for  CASKFA)  nay  highly  be  magni- 
fied.  It  seems,  therefore,  more  appropriate  to  break  down 
the  general  relations,  as  far  as  possible,  into  several 
simpler  ones.   This  may  be  done  by  expanding  equation  (1) 
in  terms  of  all  factors  which  seemingly  underly  the 
independent  variables.   Then,  measuring  each  of  these 
factors  and  statistically  analyzing  their  joint  effects 
on  the  probability  of  financial  distress  may  lead  to  a 
better  result. 
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Net  Cash-Flow  per  Dollar  of  Assets 

Starting  the  expansion  of  equation  (1)  by  decom- 
posing net  cash-flow  per  dollar  of  assets  at  time  t,  it 
is  known  that,  by  aefinition,  CASHFA   is  the  ratio  of  the 
available  net  cash-flow  at  tirne  t,  ACF  ,  to  corresponding 
assets,  ASSET  .   Thus: 

CASHFA^  =  ACF  /ASSET^  (2) 

Available  cash-flow  itself,  however,  is  determined 

2 

by  several  other  factors.    To  oe  more  exact: 

ACF   =  (NIAS  +  DEP)  -  GASSET  +  GLIAB  (3) 

Where  NIAS  =  Net  income  available  to  shareholders, 
DEP  =  Accounting  depreciation  and  depletion 
allowances  put  aside  from  operating 
income  to  compensate  the  part  of  the 
assets  used  up  in  generating  the 
revenue, 
GASSET = Growth  in  assets,  and 
GLIAB  =  Growth  in  liabilities- 

From  equation  (3) ,  it  is  seen  that  available  cash- 
flow is  directly  related  to  net  income  available  for 
shareholders  as  v/ell  as  to  depreciation  allowances.   An 
increase  in  the  assets,  however,  would  have  a  negative 
effect,  whereas  a  decrease  in  the  assets  has  a  positive 
effect  on  the  available  cash-flow.   In  contrast,  an  in- 
crease in  the  liabilities  would  have  a  positive  effect, 


"—!"»- 


40 


while  a  decrease  in  the  liabilities  has  a  negative  effect 
on  the  available  cash-flow. 

In  order  to  be  able  to  continue  the  analysis,  first 
equation  (2)  is  restated  in  the  following  general  form: 

CASHFA   =  g{NIAS,  EQC,  GASSET,  GLIAB,  DEP)      (4) 

Subsequently  observe  that  net  income  is  a  function  of 
revenue  (REV)  and  expenses  (EXP) . 

NfAS  =  h(REV,  EXP)  (5) 

Revenue 

Revenue  is  the  bloodstream  of  the  firm.   It  is, 
however,  subject  to  numerous  internal  and  external  factors. 
These  factors  may  be  grouped  accordingly  under  INTREV  and 
EXTREV  variables,  respectively,  as  in  equation  (6) . 

P^V  =  i (INTREV,  EXTREV)  (6) 

Internal  factors.   The  elements  of  this  group 
consist  of  factors  which  are  more  or  less  under  control  of 
the  firm.   When  managed  appropriaceiy ,  these  factors  have 
positive  effects  on  revenue,  otherwise,  their  impacts  are 
negative.   Such  factors  include  the  following: 

Number  of  product  lines  (NPL)  .'   As  number  of 
product  or  service  lines  increases,  even  though  they  may 
not  be  complementary,  they  reach  a  broader  market,  thus 
causing  an  increase  in  sales  and  presumably  more  revenue, 
and  vice  versa. 
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Number  of  places  it  occupies  (NOPLOC) .   An  explana- 
tion similar  to  that  made  for  the  num.ber  of  product  lines 
applies  to  the  number  of  places  that  a  firm  occupies. 

Number  of  years  in  the  business  (NYR) .   The  number 
of  years  that  a  firm  has  been  in  the  business  should  play 
a  role  in  its  profitability.   Nothing  can  replace  the 
experience  obtained  in  practice.   An  established  firm  is 
more  likely  to  be  a  profitable  one.   Its  mere  lasting 
existence  is  the  evidence  of  its  being  profitable.   The 
relationship,  however,  may  not  be  linear. 

Research  and  development  (R&D).   In  some  industries, 
research  and  development  programs  play  a  vital  role  in  reve- 
nue generation.   More  importantly,  in  many  instances,  the 
mere  existence  of  a  firm  in  some  industries  depends  on  the 
extent  to  which  it  focuses  on  R&D  acitivites. 

Number  of  patents  (NOPAT) .   Vhe  number  of  patents 
and  franchises  that  a  firm  has  in  its  possession  likely 
affects  revenue.   These  factors  give  a  monopolistic  power 
to  the  firm  having  them,  and  leads  to  potential  exploita- 
tion. 

Cash  availability  (CASHAV) .   Although  availability 
of  cash  is  an  absolute  necessity  to  run  most  types  of 
businesses,  and  even  in  some  instances  taking  advantage 
of  possible  opportunities  depends  on  this  factor,  excess 
cash  bears  an  opportunity  cost.   The  income  which  could 
be  earned  by  putting  the  excess  cash  into  a  productive  use, 
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is  the  associated  opportunity  cost.   Leaving  the  excess 
cash  idle  does  not  contribute  to  the  profit;  rather  it  de- 
ducts from  the  potential  revenue. 

Securities  marketability  (SECMKT) .   Marketability 
of  some  securities  is  a  feature  v;hich  provides  the  pos- 
sibility of  replacing  a  part  of  the  required  cash,  which 
brings  no  direct  yield,  v/ith  securities  such  as  US 
government  bonds,  which  bring  some  additional  yield  at 
practically  no  extra  risk.   If,  however,  the  amount  of  this 
item  is  in  excess  of  what  is  necessary,  in  relation  to  cash 
and  other  assets,  the  cost  of  unripe  selling  to  obtain  cash, 
or  the  differential  yield  forgone  by  not  putting  the  excess 
amount  of  securities  into  a  more  productive  investment,  is 
simply  a  deduction  from  actual  or  potential  revenue, 
respectively . 

Credit  easiness  (CREDIT) .   A  mild  credit  extension 
may  improve  sales,  but  over-easiness  may  bring  about  such 
large  bad-debt  accounts  and  costs  that  it  may  actually 
over-offset  the  increment  revenue. 

Inventory  movement  (Il-r^ENT)  .  Similarly,  a  balanced 
inventory  may  facilitate  sales,  whereas  too  much  investment 
in  this  item,  deducts  from  potential  profit. 

While  the  effects  of  CASH  and  SECMKT  on  REV  are 

negative  after  som.e  point,  the  relations  of  CREDIT  and 
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INVENT  to  REV  are  positive  over  a  range. 
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Promotional  efforts  (PROMOT) .   Another  factor  which 
is  under  control  of  the  firm  and  has  so-e  effect  on  revenue 
is  promotional  efforts.   The  relationship,  however,  is  a 
fuzzy  one.   The  number  of  psychological  and  environmental 
factors  which  act  and  react  simultaneously  is  so  many  that  it 
makes  clear-cut  inferences  too  difficult,  if  not  impossible. 
Yet,  the  large  promotional  expense  incurred  every  year  in 
all  industries  is  an  indication  of  its  positive  effect  in 
practice. 

Accounts  payable  (ACCPAY) .   Accounts  payable  nor- 
mally contribute  to  revenue.   This  is  due  to  the  poten- 
tially better  use  of  the  funds  relieved  by  such  accounts. 
Usage  of  accounts  payable,  however,  is  not  v/ithout  limi- 
tations.  It  can  reach  a  point  beyond  which  its  associated 
costs  over-offsets  its  potential  benefits. 

Debt  capital  (DBTCAP) .   In  the  world  of  existing 
taxes,  the  tax  treatment  of  interest  expenses  encourages 
greater  use  of  debt  capital.   If  the  firm  is  operating 
profitably  and  the  rate  of  interest  on  its  debts  remains 
constant  or  rises  slightly  as  the  debt  is  increased,  the 
higher  is  the  leverage,  the  higher  becomes  the  profit. 
Even  under  the  above  conditions,  however,  debt  may  not 
always  be  profitable.   Protective-covenant  constraints 
such  as  a  minimum,  working  capital,  a  percentage  lim.itation 
of  long-term  debt  to  working  capital,  and  capital  expendi- 
ture restrictions,  which  are  imposed  by  debt  owners,  may 
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have  a  significant  influence  on  the  firm's  profits.   Sev- 
eral theories  have  been  advanced  as  to  the  optimal  level 

4 
of  debt.    The  controversy  about  them  has  not,  however, 

been  solved  yet. 

Debt  ir,ix.   The  optimal  mix  of  short-term  (SHRTDT) 
in  relation  to  long-term  (LONGDT)  financing  is  also  of 
considerable  importance.   This  is  due  to  both  interest 
rate  differential  and  other  factors  such  as  flexibility, 
timing  and  etc. 

There  are  of  course  seme  other  internal  factors, 
but  their  effects  on  REV,  individually  and  perhaps  col- 
lectively, is  likely  to  be  minor,  relative  to  the 
previously-mentioned  factors.   Therefore,  the  above  dis- 
cussion may  be  summarized  in  the  following  equation: 

INTREV  =  j (NPL,  NOPLOC,  NYR,  R&D,  NOPAT ,  CASHAV, 
SECMKT,  CREDIT,  INVENT,  FROMOT,  ACCPAY, 
DBTCAP,  SHRTDT,  LONGDT)  (7) 

The  relative  importance  of  each  of  the  above 
factors  in  the  profitability  of  a  firm  differs  from  one 
industry  to  another  and  within  subdivisions  of  each 
industry.   Therefore,  the  explanatory  variables  for 
describing  the  impacts  of  such  factors  on  the  probability 
of  financial  distress  of  the  firm  should  fit  to  specific 
industry  and  its  subdivision  to  which  the  firm  belongs. 

External  factors.   The  elements  of  this  group 
consist  of  factors  over  which  the  firm  does  not  have  any 
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control,  yet  they  have  some  effects  on  the  revenue  of  the 
firm.  Aniong  these  factors,  general  business  activity  has 
a  special  place. 

General  business  activity  (GBA) .   General  business 
and  eccnoir.ic  activity  is  subject  to  recurrent  but  non- 
periodic  fluctuations,  called  business  cycles,  over  a  long 
period  of  time.   These  fluctuations  occur  in  aggregate 
variables  such  as  prices,  GNP ,  investments,  employments, 
and  income.   The  variables  move  at  about  the  same  time  and 
in  the  same  direction,  but  their  rates  of  change  are  dif- 
ferent.  Although  the  effect  of  these  fluctuations  on  a 
firm  differs  from  one  industry  to  another,  no  firm  remains 
immune  from  their  im.pact.   In  general,  over  any  period  of 
time,  the  revenue  of  a  firm  is  a  function  of  the  level  of 
general  business  activity. 

Interest  rate  (INTRR)  .   The  market  prim.e  rate  of 
interest  has  also  some  effects  on  the  revenue  of  the  firm. 
This,  of  course,  relates  to  the  lending  position  of  the 
firm.   For  a  firm  which  has  a  borrowing  position  the  ef- 
fect of  interest  rates  on  its  performance  comes  through 
expenses.   If  the  firm  is  both  lender  and  borrower  ac.  the 
same  time,  the  effect  comes  through  both  revenue  and 
expense  sides.   The  aet  effect,  then,  may  be  either  posi- 
tive or  negative,  depending  upon  the  net  lending  or 
borrowing  position  of  the  firm. 
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Population  (POPUL) .   Population  is  another  impor- 
tant external  factor.   Not  only  the  number  of  inhabitants 
in  the  related  area  matters;  the  population  mix  has  also 
bearing  on  the  revenue.   A  firm,  which  produces  youth 
products  is  m.ore  profitable  in  an  area  where  the  young 
constitute  a  higher  percentage  of  the  population  than  in 
other  areas,  other  things  being  equal. 

Per  capita  income  (PCINC) .   Per  capita  income  of 
the  inhabitants  in  the  area  where  the  firm  runs  its  busi- 
ness is  another  determxnant  of  the  firm's  revenue.   Per 
capita  income,  which  is  an  index  of  the  purchasing  power 
of  the  people  residing  in  an  area,  differs  from  one  region 
to  another.   The  variation  is  due  to  difference  in  the  mix 
of  industries  and  in  the  prevalence  of  large  cities,  where 
wages  are  higher,  than  in  small  towns  and  rural  areas. 

Competition  (COMP) .   Competition,  if  it  exists, 
diffuses  the  potential  market  power  of  the  firm.   Then, 
the  price  which  it  puts  on  its  products  and,  as  a  con- 
sequence, its  revenue,  is  governed  by  the  market.   In  fact, 
the  effect  of  competition,  which  is  the  foundation  of 
capitalism,  is  to  give  rewards,  in  the  form  of  profits, 
to  the  firms  which  are  able  to  survive,  and  penalties, 
in  the  form  of  losses  or  bankruptcy,  to  those  which  are 
not. 

Competition,  in  a  broad  sense,  embraces  all 
factors  of  production — land  (which  includes  all  natural 
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resources),  capital,  lab':.r,  and  ontrcpreneurshm .   In 
contrast  to  tho  firm's  desire  to  have  a  iTionopoly  position 
in  sellirig  products  and  monopsony  power  in  buying  supplies 
and  services  of  labor,  land,  and  capital,  the  actual  situa- 
tion which  it  faces  may  be  exactly  the  opposite  one.   Ex- 
istence of  labor  unions,  monopoly  in  some  raw  materials, 
branch  banking  systems  in  some  states,  and  exclusively 
governmental  purchases  are  commonplace  facts  which  are  faced 
by  many  firias.   All  these  factors  have  bearings  on  the 
firm's  revenue.   Indeed,  wherever  appropriate,  COMP  factor 
should  be  broken  down  into  the  above  four  dimensions  (cor- 
responding to  the  factors  of  production)  in  evaluating  the 
firm's  position  with  respect  to  competition. 

Regulations  (REGUL) .   Rules  and  regulations  are 
imposed  by  either  governmental  regulatory  agencies  or 
self-regulatory  organizations.   They  are  intended  to  regu- 
late various  aspects  of  the  activities  of  firms  in  dif- 
ferent industries.   These  rules  and  regulations  signifi- 
cantly affect  the  firm's  profit.   Rules  and  regulations 
governing  the  rates  of  public  utility  companies  and  rail- 
roads can  be  mientioned  as  examples.   The  rules  and  regula- 
tions sometimes  go  to  extremes.   In  bankirig,  for  example, 
the  regulations  require  a  relatively  low  marginal  rate  of 
return  on  capital.   Each  industry  has  a  set  of  regulations 
relevant  to  itself.   Therefore,  in  evaluating  the  position 
of  the  firm  in  a  certain  industry  with  respect  to  the 
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prevalent  rules  and  regulations  in  that  industry,  factor 
REGUL  should  be  broken  down  into  an  appropriate  number  of 
dimensions . 

The  above  discussion  can  be  mathematically  ex- 
pressed : 

EXTREV  =  k(GBA,  INTRR,  POPUL,  PCINC ,  COMP , 

REGUL)  (8) 

Expenses 

The  expense  element  (EXP)  in  equation  (5)  may  also 
be  broken  down  into  the  part  originated  by  internal  fac- 
tors (INTEXP) ,  and  that  caused  by  external  factors  (EXTEXP) . 
Thus: 

EXP  =  KINTEXP,  EXTEXP)  (9) 

Internal  factors.   Most  of  the  internal  factors 
affecting  the  overall  expenses  of  a  firm  can  be  categorized 
into  selling  expenses  (SELEXP) ,  general  expenses  (GENEX) , 
and  administrative  expenses  (ADMEXP) .   Although  all  firms 
have  these  three  types  of  expenses,  the  relative  importance 
of  each  type  differs  from  one  industry  to  another.   Even  in 
the  same  industry  the  percentage  of  each  Lype  differs 
among  firms.   This  very  difference  may  differentiate  between 
firms  operating  optimally  or  near  optimally  and  those  which 
are  run  sub-optimally.   In  fact,  there  can  always  be  found 
one  or  more  expense  items  which  are  conspicuous  in  a 
specific  industry.   As  an  example,  the  comm>unicat ion 
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expenses  of  firms  in  the  securities  industry  is  prominent. 
Evaluation  of  a  firm's  j:)Osition  with  respect  to  such  an 
item  may  reveal  some  characteristics  of  the  firm's  per- 
formance.  To  obtain  such  items,  it  is  useful  to  break  down 
the  above  three  types  of  expenses  into  more  detailed  com- 
ponents peculiar  to  each  industry.   In  any  event,  deprecia- 
tion expenses,  which  may  belong  to  one  or  more  of  the  above 
categories,  should  be  singled  out  for  calculating  the  net 
cash-flow,  as  stated  in  equation  (3).   To  summarize  the 
above  discussion: 

INTEXP  =  m(SELEXP,  GENEXP ,  ADMEXP)  (10) 

External  factors.   Although  a  little  fuzzy,  all 
external  factors  which  affect  a  firm's  profit  other  than 
through  the  revenue  side  fall  in  this  category.   One  of 
the  most  important  elements  in  such  factors  is  taxes. 

Taxes  (TAX) .   Aside  from  the  economic  purposes 
of  taxes,  i.e.,  stabilization  of  economy,  redistribution 
of  wealth,  and  reallocation  of  resources,  businessmen 
look  at  them  as  a  compulsory  pa^Tiient  to  government.   Indeed, 
they  reduce  the  firm's  profit  by  a  large  percentage,  called 
tax  rate.   The  legal  tax  rate  varies  over  a  period  of  time. 
Whenever  the  rate  is  lower,  the  income  remaining  for  share- 
holders is  larger,  other  things  held  constant,  and  vice 
versa. 

Interest  rate  (INTRR) .   The  role  of  the  interest 
rate  in  the  performance  of  the  firm  was  discussed  in  the 
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revenue  section.   To  avoid  exclusion  of  such  an  important 
factor  from  among  the  doterminarits  of  expenses,  however, 
it  is  reintroduced  in  this  section  as  well.   In  fact, 
since  debt  usually  constitutes  a  main  part  of  the  capital 
structure  of  m.ost  firms,  the  impact  of  the  interest  rate 
com.ing  through  the  expense  side  seems  to  be  m.ore  important 
for  most  firm;S.   In  any  event,  the  higher  is  the  interest 
rate,  the  larger  is  the  interest  charge  as  part  of  the 
firm's  total  expenses,  and  vice  versa. 

Environmiental  factors  (ENVIRN)  .   Other  factors 
such  as  anti-pollution  expenses,  benevolence  expenses,  and 
the  like  may  be  grouped  under  environmental  factors.   Firms 
manufacturing  defense  products  may  incur  opportunity  costs. 
These  costs,  which  are  equal  to  the  differential  profit 
forgone  by  not  employing  the  resources  in  a  more  productive 
way  should  rightly  be  considered  as  expenses.   In  summary: 
EXTEXP  =  n(TAX,  INTRR,  ENVIRN)  (11) 

Risk 

It  is  now  necessary  to  refer  back  to  equation  (1) 
and  make  an  effort  to  identify  the  components  of  the 
overall  risk  of  the  firm  which  m.ay  play  important  roles 
in  determining  the  probability  of  financial  distress. 
Before  going  into  details,  however,  a  digression  on  de- 
finition and  measures  of  risk  seems  appropriate. 
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Definition 

Risk  has  been  defined  differently  in  various 
sources,  both  in  view  of  context  and  in  regard  to  content. 
In  Webster's  Dictionary,  it  has  been  defined  as  the  pos- 
sibility of  loss,  injury,  disadvantage,  or  destruction. 
Sloan  and  Zurcher  defined  risk  as  the  possibility  of  loss 
from  some  particular  hazard,  as  fire  risk,  war  risk, 
credit  risk. 

In  the  McGraw-Hill  Dictionary,  risk  has  been  de- 
fined as  the  exposure  of  an  investor  to  the  possibility  of 

g 

gain  or  loss  of  money.    Machol  and  Lerner  defined  risk  as 

the  cumulative  probability  of  the  return  falling  belov;  some 

9 
level  of  ruin.    And,  in  the  state-of -nature  approach  to 

evaluating  risk,  it  is  implicitly  defined  as  the  possibility 
that  the  desired,  rather  than  the  expected,  return  will  not 
be  achieved. 

Although  all  of  these  definitions  suit  their  own 
places,  there  is  need  for  some  definition  which  fits  the 
context  under  study.   For  that  purpose,  risk  is  here 
defined  as  the  possibility  that  actual  outcome  of  a 
random  variable  "^  will  deviate  from  that  which  is  expected. 
The  random  variable  can,  of  course,  represent  such  di- 
verse factors  as  cash-flow,  rate  of  return,  expenses, 
bad  debts,  or  even  the  business  activity  level. 
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C.  ommon  yr^  as  ares  of  Ri  s  k 

Having  risk  thus  defir.ed,  then,  a  measure  of  risk 
is  a  means  by  which  the  extent  of  risk  is  ascertained. 
There  is,  however,  no  single  measure  of  risk  which  can  well 
explain  the  actual  risk  associated  with  any  factor  under 
consideration.   Facing  this  difficulty,  the  authors  in  the 
field  of  econouiic  and  business,  and  especially  in  the  por- 
folio  literature,  have  adopted  various  means  as  measures 
of  risk,  each  with  its  own  virtues  and  lin.itations .   So 
far,  the  most  popular  measure  of  risk  has  been  variance 
(or  standard  deviation)  of  a  random,  variable  around  its 
mean. 

Variance  or  standard  deviation  of  a  si ng 1 e 
variable.   Tobin,  in  adopting  the  standard  deviation  as  a 
measure  of  risk,  although  in  a  portfolio  context,  m.ade  the 

following  statement  in  his  celebrated  paper  on  "Liquidity 

T  7 

Preference  as  Behavior  Towards  Risk:" 

The  risk  attached  to  a  portfolio  is  to 
be  measured  by  the  standard  deviation  of  R, 
Oj^.   The  standard  deviation  is  a  measure 
of  the  dispersion  of  possible  returns 
around  the  mean  value  u^^.   a  high  standard 
deviation  means,  speaking  roughly,  high 
probability  of  large  deviations  from  '-^n, 
both  positive  and  negative.   A  low  standard 
deviation  means  low  probability  of  large 
deviations  from  v-,    in  the  extreme  case, 
a  zero  standard  deviation  would  indicate 
certaincy  of  receiving  the  return  ^ ^.       Thus 
a  high-Oj^  portfolio  offers  the  investor  the 
chance  of  large  capital  gains  at  the  price 
of  equivalent  chances  of  large  capital 
losses.   A  low-'j^  portfolio  protects  the 
investor  from  capital  loss,  and  likewise 
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gives  hir^.  little  prospect  of  unusual  gains. 
Althoucjh  it  is  intuitively  clear  that  the 
risk  of  a  portfolio  is  to  be  identified  v;ith 
the  dispersion  of  possible  returns,  the 
standard  deviation  is  neither  the  sole  mea- 
sure of  dispersion  nor  tlie  obviously  most 
relevant  measure. . . . 

The  variance  of  a  random  variable  is  the  expected 

value  of  the  squared  deviations  from  the  mean: 

V(R)  =--       E  (R.  -  R)''  P.  (12) 

i-1   ^         ^ 

where  V{R)  is  the  variance  of  the  random  variable  R,  n  is 
the  number  of  possible  outcomes,  R.  is  the  i-th  possible 
outcome,  R  is  the  expected  value  of  all  possible  outcomes, 
and  P.  is  the  probability  or  the  relative  frequency, 
whichever  is  appropriate,  of  occurrence  of  the  i-th  event. 

The  standard  deviation  is  simply  the  square  root 
of  the  variance: 


0^   =    \JV(R)  (13) 

The  variance  obtained  by  equation  (12)  is  in  an 
absolute  term.   For  comparing  the  variability  of  different 
distributions,  however,  there  is  need  for  a  measure  of 
relative  dispersion.   This  need  is  met  by  forming  the 
coefficient  of  variation  as  expressed  below: 

CV(R)  =  V(R)  /  R  (14) 

Semi-variance.   Markowitz,  who  pioneered  the  modern 
theory  on  portfolio  selection,  considered  the  standard 


deviation,  the  semi-variance,  the  expected  value  of  loss, 

the  expected  absolute  deviation,  the  probability  of  loss, 

13 

and  the  maximum  loss  as  candidate  measures  of  risk. 

Among  these  measures,  however,  he  concentrated  more  on  the 

14 
first  two  measures.     One  of  the  reasons  for  considering 

semi-variance  as  a  candidate  m.easure  of  risk  is  its  ability 

to  measure  the  skewness  of  the  probability  distribution  of 

the  random  variable,  say,  return. 

Unlike  the  variance,  which  ignores  the  shape  of  the 
probability  distribution,  semi-variance  takes  into  account 
the  downside  or  upside  tendency  of  the  probability  distri- 
bution.  Since  most  economic  units  associate  risk  with  the 
possibility  of  loss,  downside  fluctuations  in  return,  and 
not  dispersion  per  se ,  have  considerable  importance.   A 
distribution  skewed  to  the  left,  as  indicated  by  semi- 
variance,  would  involve  more  chance  lor  low  or  negative 
returns  than  a  distribution  skewed  to  the  right,  all  other 
things  being  the  same. 

Semi-variance  can  be  expressed  in  the  same  way  as 
variance  was  stated  in  equation  (12)  with  the  exception 
that  n  and  i  now  represent,  respectively,  only  total  num- 
ber of  those  outcomes  and  the  i-th  outcome  whose  values 
are  smaller  (or  greater,  according  to  one's  purpose)  than 
R,  the  overall  m.ean  value: 

SV(R)  =  I        [R.     -    R)^    P.  (15) 

i-1    ^         ^ 


Markowitz  suggested  the  ratio  of  V(R)/2SV(R)  as  a 
measure  of  relative  sko\vness.     For  synL-netric  distributions 
this  ratio  equals  one,  but  it  is  greater  or  less  than  one  if 
distribution  skewed  to  the  right  or  left,  respectively. 

Third  moment.   The  measure  of  skewness  is  not  limi- 
ted, of  course,  to  semi-variance.   Skewness,  in  absolute  or 
relative  sense,  may  be  measured  also  by  the  third  mom.ent  of 
the  probability  distribution  of  the  random  variable.   The 
third  moment  about  the  mean  may  be  expressed  by: 

""  -   3 

m  =      Z       (R.  -  R)   P.  (16) 

which  is  similar  to  equation  (12),  with  the  exception  that 
the  deviations  from  mean,  the  term  (R.  -  R) ,  is  now  cubed 
rather  than  squared.   Cubing  the  term  does  not  change  the 
sign  of  the  deviations,  but  it  magnifies  the  extreme  devia- 
tions, thus  making  the  risk  measurement  possible.   Against 
this  advantage,  the  third  moment  has  the  same  disadvantage 
that  other  moments  have,  i.e.,  it  is  greatly  affected  by 
a  few  extreme  deviations.   In  fact,  since  the  absolute 
amount  of  skewness  depends  on  the  dispersion  of  data,  a 
given  value  of  m^  could  be  the  result  of  slight  asymimetry 
together  with  large  dispersion,  or  great  asymm.etry  with 
small  dispersion.   Its  usefulness  cannot  be  doubted,  how- 
ever, especially  when  it  is  combined  with  other  moments. 
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Arditti,  in  an  empirical  study,  "Risk  and  the 

Required  Return  on  Equity,"    employed  second  and  third 

IS 
moments  along  with  some  other  risk  factors,    and  concluded, 

among  other  things,  that  the  market  required  return  had 
direct  relation  with  third  moment,  as  was  expected  from  a 
measure  of  risk. 

Third  moment  may  become  more  useful  in  comparing 
among  several  distributions  if  it  is  expressed  in  a  rela- 
tive term.   This  is  done  by  dividing  the  moment  by  the  cube 
of  the  standard  deviation  of  the  random  variable  and  is 
called  standardized  third  moment: 

^3  "  ™3/''r  ^■^''^ 

Rate  of  growth.   It  was  discussed  in  previous  para- 
graphs that:  third  moment  about  the  mean  is  greatly  affected 
by  a  few  extreme  deviations.   In  fact,  the  sign  of  the 
moment  may  be  largely  attributable  to  a  single  large  nega- 
tive (or  positive,  as  appropriate)  deviation.   Further- 
more, what  a  measure  of  skewness  shows  is  the  existence  of 
asymmetry  in  the  frequency  distribution.   It  can  not  iden- 
tify hovv  this  asymmetry  occurred,  i.e.,  whether  it  was  due 
to  regular  or  irregular  changes  in  the  values  of  the  random 
variable.   It  is  obvious  that  a  steady  trend  in  one  direc- 
tion removes  most  of  the  uncertainty  (or  risk)  associated 
with  predicting  a  certain  value  for  the  variable,  whereas 
irregular  fluctuations  make  such  prediction  very  difficult. 


In  order  to  del:ect  the  regular  clianges  in  one  direc- 
tion, the  concept  of  rate  of  growth,  whether  rositive  or 
negative,  can  be  utilized.   There  are  two  ways  to  get  the 
rate  of  growth:   the  first  one  is  through  applying  a  geo- 
metric mean  m.ethod.   The  second  is  through  utilizing  the 
regression  analysis  concept.   The  latter  method,  v.'hich  is 
discussed  below,  seems  to  have  m>ore  advantage  than  the 

former.   This  is  due  to  its  ability  to  provide  some  addi- 

2 

tional  information,  such  as  R  ,  the  coefficient  of  deter- 
mination of  the  regression  line. 

The  model  which  is  discussed  here  has  originated 

from  that  which  is  used  for  computing  the  continuously 

19 

compounded  rate  of  interest.     It  is  expressed  as: 

V^  -  Ae^^  (18) 

Where  V   is  the  value  of  the  variable  after  t  years,  A  is 
the  initial  value,  e  =  2.71828,  r  is  the  instantaneous  rate 
of  growth,  and  t  is  total  years  under  consideration. 

To  calculate  r,  first  take  the  natural  log  from 
both  sides  of  equation  (18) : 

In  V   =  In  A  +  rt  (19) 

Then,  by  utilizing  equation  (19),  regress  In  V 

2 
against  z,    and  obtain  r  as  well  as  In  A  and  R  ,  the  coef- 

2 
ficient  of  determination,  of  the  regression  line.   R  , 

which  is  bounded  between  zero  and  one,  is  a  useful  measure 
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of  the  goocinoss  of  fit  of  the  regression  line.   The  closer 

2  20 

IS  R   to  one,  the  better  is  tne  fit. 

2 

In  analyzing  risk,  it  scGms  more  useful  to  use  R 

along  with  r,  the  rate  of  growth,  and  both  R"  and  r  in  con- 
junction with  other  measures  of  risk  such  as  variance  and 
semi-variance,  because  each  of  these  measures  captures  only 
a  portion  of  total  risk  associated  with  the  variation  of  the 
random  variable. 

Confidence  level.   Bauraol,  who  was  also  concerned 

about  the  downside  fluctuations  of  a  random  variable,  such 

21 

as  return,  proposed  still  another  measure  of  risk.     His 

measure  of  risk  resembles  a  lower  bound  or  confidence 
limit  on  the  expected  outcome  and  is  expressed  by: 

L  =  R  -  koj^  (20) 

where  L  is  the  suggested  measure  of  risk,  R  is  the  mean  of 
the  random  variable,  k  is  a  positive  parameter  that  re- 
presents the  number  of  standard  deviations  below  the  mean 
value,  and  a^   is  the  standard  deviation.   Unlike  the 
previous  measures  of  risk,  however,  this  one  is  not  in- 
dependent of  the  evaluator's  preferences.   That  is, 
before  any  value  can  be  found  for  L,  a  value  should  be 
assigned  by  the  evaluator  to  k.   The  dependence  of  this 
measure  of  risk  on  the  evaluator's  utility  function  detracts 
from  its  usefulness  for  the  purpose,  here. 


Covari anco .   One  of  the  useful  measures  cf  risk  is 
covariance  betv.'cen  two  random  variables,  say,  the  firm's 
net  income  and  the  changes  in  business  activity.   It  is  a 
measure  of  the  extent  to  which  the  value  of  the  two  vari- 
ables tend  to  move  up  and  down  together.   It  can  be  ex- 
pressed by  the  following  equation: 

n         _        _ 
COV(R,Q)  =1:   (R   -  R)  (Q   -  Q)  P  f21) 

i=l    1        1        1 

where  R  and  Q  are  two  random  variables  with  expected  values 

of  R  and  Q,  respectively,  R.  and  Q.  are  the  i-th  possible 

outcomes  of  R  and  Q,  n  is  the  total  possible  outcomes  of 

each  variable,  and  P.  is  the  joint  probability  that  R.  and 

Q.  will  occur  sim.ultaneously .   As  is  seen  from  equation 

(21),  when  both  R.  and  Q.  are  above  or  below  their  related 
11 

means  for  most  of  the  times,  the  covariance  becom.es  posi- 
tive, v/hereas,  when  one  of  them  is  above  its  mean  more 
frequently  while  the  other  is  below  its  mean  for  most  of 
the  times,  the  covariance  becomes  negative. 

Correlation  coefficient.   Although  the  covariance 
formula  is  a  good  mean  to  explain  the  relative  movements 
of  the  random  variables,  its  absolute  value  is  difficult 
to  interpret.   A  bettor  measure  which  is  directly  related 
to  covariance  and  can  be  easily  interpreted  is  correlation 
coefficient.   It  may  be  expressed  by: 

COR(R,Q)  =  CO^(R^  (22) 

"R  •  ^^Q     . 
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If  the  two  random  variables  move  up  and  down  in 
perfect  unison,  the  correlation  coefficient  is  one  (they 
are  perfectly  correlated) .   A  correlation  coefficient  of 
zero,  on  the  other  hand,  indicates  that  their  movements 
are  completely  independent  from  each  other  (they  are  uncor- 
related) .   If  the  two  variables  move  in  reverse  direction 
in  perfect  unison,  the  correlation  coefficient  is  minus 
one  (-1).   Any  other  imperfect  correlation  lies  somewhere 
between  +1  and  -1. 

Variance  of  a  group  of  variables.   In  the  port- 
folio context,  where  the  return  on  portfolio  is  dependent 
upon  the  returns  on  the  constituent  securities,  it  has 
long  been  known  that  the  variance  of  the  portfolio  return 
is  a  function  of  the  variances  of  the  comprising  securities 
as  well  as  the  correlation  coefficients  between  every  pair 
of  these  securities.   More  exactly: 


n    n 
V(P)  =   E     Z   X.  X.  COR(i,j)  0.  a.         (23) 
i=l   j=l   ^   " 


1      J  '^13 


where  n  is  total  number  of  securities  in  the  portfolio,  X. 

and  X..  are  the  percentages  of  funds  invested  in  securities 

i  and  j,  COR(i,j)  is  the  correlation  coefficient  between 

these  securities,  and  a-   and  o .  are  standard  deviations  of 

1      3 

securities  i  and  j . 

The  concept  may  bo  extended  to  contexts  other  than 
portfolio,  where  determ.ination  of  a  variation  of  a  single 
random  variable,  such  as  operating  revenue  of  a  firm,  in 
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terms  of  v.iriations  of  several  other  randon  variables,  for 
example,  revenue  from  each  line  of  product,  is  also 
desirable.   All  elements  of  equation  (23)  can  then  be 
easily  redefined  to  fit  such  cases,  except  for  X^  and  X. 
which  should  be  interpreted  in  a  slightly  different  way. 
In  retrospect,  when  past  series  data  are  used,  one  may 
interpret  X.  and  X.  as  the  shares  of  random  variables  i 
and  j,  respectively,  in  the  overall  picture.   For  example, 
if  talking  about  the  variation  in  the  company's  revenue, 
one  may  interpret  X.  and  X.  as  average  proportions  of  the 
total  revenue  which  are  contributed  by  product  lines  i  and 
j,  respectively. 

Computation  of  the  collective  variance  by  equation 
(23)  is  easy  when  the  number  of  constituent  random  vari- 
ables is  small.   For  a  large  number  of  such  variables  the 

computation  in  this  way  becomes  cumbersome.   The  diagonal 

2  2 
model  presented  by  Sharpe    has  solved  this  problem.   In 

the  portfolio  context,  the  following  steps  should  be  taken 

to  calculate  the  variance  of  portfolio: 

First,  a  suitable  rr;arket  index  such  as  GNP  is 

selected  and  then  by  using  the  following  linear  equation 

for  each  security,  individually,  various  returns  of  each 

security  are  regressed  against  the  corresponding  value 

of  the  index: 

R.  .  =  A.  +  B.I  .  +  e.  (24) 

ID     1     ^    J  1 


where  R.  .  is  the  i-th  observation  cf  the  randcrr  variable 

i,  I.  is  the  i-th  observation  of  tiie  index,  A  ,  E.  and  e. 
J  '   1 '   1       1 

are  y-intercept,  slope,  and  random  error  related  to  the 
regression  lino  of  thio  variable  i,  respectively. 

Then,  by  using  the  results  obtained  fron  this 
step  in  the  following  equation,  variance  of  the  portfolio 
is  calculated: 


V(P)  =   ?  xl    al    +  Xl^^ol^^  (25) 

1  =  1 


where  n  is  the  total  number  of  securities  in  the  portfolio, 

X.  is  the  percentage  of  funds  invested  in  security  i,  a. 

is  the  standard  deviation  of  e.  in  (24),  a   ,  is  the  stan- 

1  n+1 

dard  deviation  of  the  index,  and  X  ,,  is  a  "pseudo"  secu- 

'       n+1       ^ 

rity  defined  as 

n 

X  ^,  =-   Z    X.  3.  (26) 

n+1    .,11 
1  =  1 

The  expected  value  of  the  portfolio,  which  is 
needed  for  computing  the  coefficient  of  variation,  is  cal- 
culated by  the  following  equation: 

E(P)  =  J^   X.A.  +  X^^^A^^^  (27) 

where  A  ^,  is  the  expected  value  of  the  index.   The  co- 
n+i. 

efficient  of  variation  is: 


CV(P)  =  V(P)/r  (?)  (28) 

Again,  these  formulations  may  be  applicable  to 
contexts  other  than  portfolio,  as  will  be  shown  later. 

The  probability  of  disaster  level.   In  the  economic 
world  v/here  the  outcomes  of  most  activities  are  subject  to 
uncertainty  there  is  always  a  range  of  outcom.es  for  each 
activity  over  which  the  econom.ic  unit  is  drastically  af- 
fected.  The  upper  bound  of  the  range  where  the  vulner- 
ability to  the  chance  event  just  begins  may  be  referred  to 
as  disaster  level. 

Examples  of  disaster  level  can  be  found  abundantly 
in  the  firm's  various  facets.   If  the  firm's  net  cash-flow 
drops  below  a  certain  point,  for  example,  the  firm  may 
face  a  hard  time.   When  the  sales  shrink  to  a  specific 
level,  it  may  experience  large  losses.   Or,  if  the  total 
asset  value  declines  to  a  level  which  is  less  than  the 
total  liabilities,  the  firm  becomes  insolvent. 

If  a  means  can  bo  found  to  show  the  chance  of  a 
random  variable  being  at  the  disaster  level  or  below, 

then  it  can  be  used  as  an  appropriate  measure  of  risk  for 

23 
dealing  with  such  cases.   Fortunately,  following  Roy, 

one  can  apply  Tchebychef f ' s  inequality  in  obtaining  such 
probability.   This,  of  course,  assumes  possession  of  in- 
formation about  the  moan  and  variance  as  well  as  the 
disaster  level  of  the  random  variable.   The  Tchebvcheff 
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inequality  raay  be  expressed  as  the  following  in  tiiis 
situation: 


P(|R  -  Ej  >  E  -  d)  .^  a^/(E  -  d)^  (29) 


or, 


P(R  <  d)  .<  a^/(E  -  d)^  (30) 


vhere  P  is  the  probability  operand  and  R  is  a  random  vari- 
able whose  expected  value,  variance,  and  disaster  level 

2 
are,  respectively,  E,  a      and  d. 

One  of  the  advantages  of  the  Tchebycheff  inequality 
is  that  it  does  not  require  the  probability  distribution  of 
the  random  variable  to  be  normal.   There  is,  however,  a 
difficulty  in  applying  equation  (30)  in  the  context  used 
here.   This  difficulty  is  due  to  the  unavailability  of 
disaster  level  (d)  in  any  of  the  related  cases.   For- 
tunately, however,  a  designated  d  need  not  be  very  exact, 
because  firms  usually  are  able  to  manage  their  affairs  over  a 
range  of  each  variable.   And  this  range  may  not  be  rela- 
tively too  narrow. 

A  sufficiently  close  approximation  of  d  may  be 
obtained  by  averaging  the  variable  over  all  available 
firms  of  similar  size  and  nature  which  have  actually  ex- 
perienced the  financial  distress.   The  implied  assumption 
here  is,  of  course,  the  dependence  of  the  financial  dis- 
tress upon  the  disaster  level  of  the  variable. 
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If  data  iinavailability  precludes  grouping  the 
firms  of  similar  size  and  nature  for  this  purpose,  one  may 
resort  to  some  appropriate  ratio  of  the  variable.   The 
ratio  could  first  be  averaged  over  all  available  firms 
v;hich  have  experienced  financial  distress  and  the  average 
could  then  be  transformed  to  an  absolute  value  for  each 
firm  by  a  simple  calculation.   As  an  example,  suppose  that 
the  average  disaster  ratio  of  sales  to  assets  turns  out  to 
be  0.5,  then  the  disaster  level  of  sales  for  a  firm  having 
a  total  assets  of  one  million  dollars  can  be  obtained  by 
multiplying  one  million  by  0.5  which  equals  $500,000. 

Risk  factors 


Risk  associated  with  cash-flov;  (RSKCF)  .   The  im- 
portance of  sufficient  cash  inflow  in  meeting  the  firm's 
obligations  cannot  be  over-stated.   Since  cash-flow  is  a 
random  variable,  at  any  point  of  time,  it  may  take  some 
value  between  tv;o  possible  extreme  points  relevant  to  the 
firm.   But,  at  some  level  of  cash  inflow  which  is  of 
course  different  for  each  firm,  the  firm  may  not  be  able 
to  run  the  business  smoothly.   The  possibility  that  the 
cash  inflow  falls  below  this  disaster  level,  then, 
constitutes  a  risk  factor  v/hich  may  play  an  important 
role  in  determining  the  possible  financial  distress  of 
the  firm. 

If  it  is  possible  to  estimate  such  a  disaster 
level  for  each  firm,  it  may  then  be  possible  to  estimate 


ef. 


the  probibility  of  its  occurrence  by  rrsorting  to  the 
Tchebychoff s  inequality,  as  reproduced  here. 


P(ACF  <  d)  <  o^^^/iACF    -  d)^  (31) 


where  P  is  the  probability  operand,  ACF  is  the  available 
cash--flcw  variable  whose  mean  value  and  variance  are  ACF 
and  >-"'ap-'  respectively,  and  d  is  the  disaster  level  for 
available  cash-flow. 

To  find  the  disaster  level  of  each  firm  it  is 
highly  desirable  to  have  access  to  data  of  as  many  failed 
firms  as  possible  v/hich  have  the  same  characteristics, 
particularly  with  respect  to  size  and  business  nature,  as 
those  of  the  firm  under  consideration.   But,  since  finding 
such  firms  is  not  always  possible,  a  reasonable  approxima- 
tion may  hp   oV->tained  by  first  averaging  the  ratio  of  ACF 
to  total  assets  of  all  failed  firms  in  the  same  industry, 
and  then  transforming  this  average  ratio  to  an  absolute 
value  through  its  m.ultiplication  by  total  assets  of  the 
firm. 

To  find  the  probability  that  the  cash-flow  of  the 
firm  may  fall  below  this  calculated  disaster  level,  it  is 
necessary  to  calculate  the  mean  value  and  variance  of 
cash-flows  of  the  firm  over  a  recent  period  of  time  to  be 
used,  along  with  the  calculated  disaster  level,  in  the 
Tchebychef f ' s  inequality.   The  probability  obtained  in  this 
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way  :.s  a  conservative  one,  yet  it  may  be  a  useful  approxi- 
mation to  risk  assumed  by  the  firm  with  respect  to  its 
cash-flow  procedviro. 

Donaldson  has  proposed  that  the  m.ain  concern  of  a 
company  is  to  see  whether  the  cash  balances  fall  below 

zero  or  not,  and  to  detormiine  such  possibility,  he  suggests 

24 

examining  the  cash-flov;s  under  recession  condition. 

If  that  is  the  case,  ACF  in  equation  (31)  may  be 
redefined  to  be  net  cash-flow.   Then,  the  disaster  level, 
d,  changes  to  zero,  and  the  probability  that  the  net  cash- 
flov;  falls  below  zero  becomes  equal  to  or  less  than  the 
ratio  of  the  coefficient  of  variation  to  the  m.ean  value  of 
the  net  cash-flow: 

P(ACF  <:  0)  $  CV(ACF)/SCF  (32) 

The  preference  of  (31)  to  (3?) ,  or  the  other  way 
around,  as  a  determinant  of  the  firm's  risk  exposure 
depends,  of  course,  on  the  results  of  empirical  checks. 

Risk  associated  with  revenue  (RSKREV) .   Risk 
associated  with  revenue  is  the  possibility  that  actual 
revenue  will  deviate  from  that  which  is  expected. 

Following  the  discussion  above  of  the  measure  of 
risk  section  with  respect  to  the  properties  of  the  variance 
of  a  random  variable,  one  m.easures  the  risk  associated  with 
total  revenue  by  finding  the  variance  of  the  values  taken 
by  revenue  variable  over  the  period  under  consiclcration . 
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Equation  (33)  may  be  used  for  such  purpose: 

V(REV)  =  -    .-:   (REV.  -  RLV)^  (33) 

n   .^.^ 

where  V(REV)  is  variance  of  revenue,  n  is  total  number  of 
observations  on  revenues,  RLV .  is  the  i-th  observation,  and 
REV  is  the  mean  value  of  total  observations  on  revenue. 
This  equation  provides  an  absolute  value  for  the  vari- 
ability of  revenue.   Its  relative  value,  however,  which 
is  obtained  by  coefficient  of  variation,  may  be  more  useful 
in  discriminant  analysis: 

CV(REV)  =  V (REV) /REV  (34) 

As  was  discussed  in  previous  section,  however,  the 
variance  alone  does  not  capture  all  the  risk  associated 
with  revenue.   If  the  frequency  distribution  of  revenue  is 
skewed  to  the  left,  there  is  higher  possibility  of  loss, 
and  as  a  consequence,  higher  possibility  of  financial 
distress.   It  may,  therefore,  be  helpful  to  supplem.ent  the 
coefficient  of  variation  v;ith  a  m.easure  of  relative  skew- 
ness.   For  this  purpose  it  is  possible  to  use  either  semi- 
variance  or  third  moment  of  revenue.   But  to  choose  betv;een 
them  by  detecting  the  discriminatory  power  of  each  of  them, 
it  seem.s  appropriate  to  use  both  in  this  analysis.   The 
semi-variance  of  revenue  may  be  expressed  by: 

1    n  2 

SV(REV)  =  -         T.        (REV.  -  REV)  (3  5) 

i  =  l 
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v/here  n  represents  the  number  of  values  below  the  nean, 

REV,    and  RPJV .  is  the  i-th  observation  of  such  values. 
1 

The  relative  skev/ness,  using  seirii-variance ,  is 
thus  obtained  by  the  following  ratio: 

CSV(REV)  =  V(x^.EV)/2SV(REV)  (36) 

The  third  moment  about  the  mean  of  the  frequency 
distribution  of  revenue  may  be  expressed  by: 

1    "  3 

m^  =  -    Z    (REV.  -  REV)  (37) 

3    n   .  ,       1 
1=1 

where  right-hand  symbols  are  the  same  as  those  in  equation 

(33)  . 

The  standardized  third  moment  which  indicates  the 

relative  skewness  of  revenue  frequency  distribution  is 

then  obtained  by: 

3 
a^  =  m^  /  V(REV)^  (38) 

For  the  reasons  explained  in  the  measures  of  risk 
section,  it  may  be  useful  to  consider  the  rate  of  growth 
of  revenue  in  conjunction  with  the  above-mentioned  m^easures 
of  revenue  dispersion  and  skewness.   It  is  necessary  to  re- 
define the  variables  in  equation  (19),  first: 


In  V   -  In  A  +  rt  (39) 

where  A  is  the  initial  total  revenue,  V   is  total  revenue 
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after  t  years,  r  is  the  rate  of  growth,  and  t  is  the  t-th 
year  of  T,  the  total  years  under  consideration. 

Then. a  regression  line  is  fitted  to  the  points 
obtained  by  plotting  the  natural  log  of  total  revenue  of 
each  year  (V  )  against  the  tin-.e  (t)  .   The  rate  of  growth 
and  R   obtained  from  this  lino  is  then  used  in  the  analysis. 

Risk  associated  with  product  lines  (RSKPL) .   The 
risk  associated  with  revenue  changes  with  the  number  of 
product  lines  that  the  firm  offers.   The  reason  is  based  on 
the  same  principle  applied  to  diversification  in  investment. 
Indeed,  having  several  product  lines  may  be  necessary  but 
it  is  certainly  not  sufficient  to  reduce  risk.   If  all  the 
product  lines  are  perfectly  positively  correlated,  with 
respect  to  revenue,  with  each  other,  then  there  accrues  no 
benefit  from  such  diversification.   Their  collective  per- 
formance resembles  that  of  any  constituent  individual  line, 
except  on  a  larger  scale. 

Negative  correlations  between  revenues  generated  by 
each  product  line,  on  the  other  hand,  reduce  the  risk  as- 
sociated with  revenue.   The  higher  is  the  degree  of  diver- 
sification of  product  lines  witli  such  properties,  the  lower 
is  the  related  risk.   The  recent  great  wave  of  conflomera- 
tion''   seems  to  be  partially,  at  least,  in  pursuit  of  such 
concept . 

Existence  of  some  negative  correlations,  however, 
does  not  guarantee  that  the  collective  risk  is  changed 
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signi.f  icantly .   If  the  analysis  of  the  historical  data 
indicates  that  the  product  line  with  negative  correlation 
contributes  only  a  small  percentage  to  the  total  revenue, 
its  risk-reducing  effect  would  be  minor,  and  vice  versa. 

All  the  above  points  are  accounted  for  if,  follow- 
ing the  portfolio  concept,  the  variance  of  total  revenue  is 
expressed  by  the  following  function: 

n     n 
V(REV)  =   E     Z    X.  X.  COR(i,j)a.  a.         (40) 
1=1   j  =  l    ^   -^  "-   J 

where  n  is  the  number  of  product  lines,  X. ,  X.  are  the 
historical  average  proportions  of  total  revenue  which  come 
from  lines  i  and  j,  respectively,  COR(i,j)  is  the  correla- 
tion between  revenues  from  lines  i  and  j,  and  a.,  a.    are 
standard  deviations  of  revenues  from  lines  i  and  j , 
respectively. 

Under  a  certain  condition,  it  may  be  possible  to 
calculate  the  risk  associated  with  product  lines.   This 
condition  is  that  the  firm  should  either  have  only  one 
store,  or  else,  if  it  is  a  multi-store  firm,  all  the  stores 
should  be,  revenue-wise,  perfectly  positively  correlated 
to  each  other.   If  this  condition  is  not  met,  however, 
the  calculated  risk  represents  the  combined  risks  of  both 
product  lines  and  stores. 

To  find  the  risk  associated  with  product  lines, 
assume  first  that  all  the  lines  are  perfectly  positively 
correlated  with  each  other;  then  all  correlation  terms  in 
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equation  (40)  would,  by  definition,  be  equal  to  one.   This 
constitutes  the  upper  limit  of  the  variance  of  total  reve- 
nue.  On  the  other  hand,  if  all  the  product  lines  were 
perfectly  negatively  correlated  to  each  other,  all  correla- 
tion terms  in  equation  (40)  would,  by  definition,  equal 
minus  one.   This  constitutes  the  lower  liiriit  of  the  vari- 
ance of  total  revenue.   For  any  imperfect  correlation, 
either  positive  or  negative,  between  product  lines,  each 
correlation  term  in  equation  (40)  assumes  a  value  between 
-1  and  +1.   The  weighted  average  of  these  correlations 
then  determines  the  position  of  the  variance  of  total 
revenue  between  its  upper  and  lower  limits. 

Bearing  in  mind  the  above  discussion,  one  may 
express  the  upper  limit  of  the  variance  of  total  revenue 
by  the  following  equation: 

n     n 
V(REV)  -   Z     Z    X.  X.  a.  j  .  (41) 

i=l   j^l    ^   3   1   D 

The  case  for  any  position  between  upper  limit  and 
lower  limit  could,  of  course,  be  expressed  by  equation 
(40)  itself.   The  deduction  from  variance  of  total  revenue 
which  resulted  from  imperfectly  correlated  product  lines, 
or  equivalently ,  the  risk  (although  negative)  associated 
with  product  lines,  is  then  equal  to  the  negative  of  the 
difference  between  equations  (40)  and  (41). 
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n   n  n    n 

RSKPL  =  -IT.        y.     X.  X.  J.  o.  -I    :■  X,  X^  COR(i,j)  J.  a.] 
1-1  j=l  ^  '   ^  ^  1=1  D=l  "  ^  ■' 

The  first  term  in  the  bracket  can  be  rewritten  as: 

n        ^     n     n 

r   X    J   +  L  Z      X.  X.a.o.,  for  all  i^^j 

i  =  l   ^    ^    i  =  l   j  =  l   ^   ^   ^   3 

The  second  term  in  the  bracket  can  be  rewritten  as: 

n    _  -,    n    n 

I   X  a  +  2    1  X.  X.  COR(i,j)  a-    a.,  for  all  ij^j 
i=l   ^  "   i-1  j=l  ^  ^  -^ 

Replacing  the  terms  in  brackets  with  their  equivalents, 

"22 
omitting   Z   X.  •  o.,  and  factoring  common  factors,  there 

i=l      ^  ^ 

a  s   obtained 

n         n 
RSI<?L  =  -     E         I     X.   X.   c.    a.    [1  -  COR(i,j)]    for  all  if^j 
i=l     j=l     ^     ^     ^     ^ 

(42) 

The  negative  sign  in  front  of  the  r.  notation 
indicates  that  multiple  product  lines  have  a  risk-reducing 
effect,  but  to  compare  the  effects  of  various  product  lines 
one  should  compare  their  negative  effects  on  reducing  risk 
associated  v?ith  total  revenue.   The  larger  is  the  absolute 
value  of  RSKPL,  the  larger  is  the  deduction  from  the  vari- 
ance of  total  revenue  as  a  result  of  such  diversification, 
and  vice  versa.   Therefore,  the  larger  is  the  absolute 
value  of  RSKPL,  the  more  desirable  is  the  product  line,  and 
in  the  discussed  risk  context,  the  smaller  is  the  risk 
associated  with  it.   Conversely,  the  smaller  is  the  absolute 


value  of  RSKPL,  the  largr^r  is  the  risk  associated  with 
product  line.   The  lower  bound  of  the  absolute  value  of 
RSKPL  is  zero,  and  it  occurs  just  when  all  the  product 
lines  are  perfectly  positively  correlated,   i.e.,  correla- 
tions between  all  product  lines  are  equal  to  one  and  thus 
all  [l-COR(i,j)]  terms  in  equation  (42)  become  zero.   This, 
of  course,  means  that  no  benefit  is  accrued  from  such  di- 
versification. 

The  risk  expressed  in  equation  (42)  is  in  an  absolute 
term.   In  order  to  make  it  in  a  relative  term,  and  bring 
it  into  a  line  with  the  coefficient  of  variation,  divide 
both  sides  of  (42)  by  the  mean  value  of  total  revenue. 
Thus,  denoting  the  relative  risk  associated  with  product 
line  by  RRSKPL,  there  is  obtained: 

_    n    n 
-J—      I         E  X.  X.  o.  a.  [1  -  COR(i,j)],  for 


REV  i=l   j=l  ^   J   ■"   ^ 


all  i  7^  j  (43) 


If  the  product  lines  are  many  and  calculation  of 
the  covariance  terms  is  cumbersome,  it  may  be  helpful  to 
apply  the  indexing  method  in  computing  the  risk  associated 
with  product  lines.   In  that  case,  GNP  or  another  suitable 
market  index  is  selected  and  variables  in  equations  (24) 
through  (27),  which  are  reproduced  here,  are  redefined  to 
fit  the  revenue  context.   Thus: 

REV. .  =  A.  +  B.I .  +  e.  (44) 

ij     1     ID     1 
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Where  REV. .  is  the  j-th  observation  of  the  revenue  gener- 
ated by  product  line  i,  I.  is  the  j-th  observation  of  the 
index,  and  A.,  B . ,  and  e^  are  y-intorcept,  slope,  and  random 
error  related  to  the  regression  line  of  the  product  line 
i,  respectively. 

n 


v2   2  ,  ,.? 
i-1 

2  . 


V(REV)  =  >   X^  o^  ^  x;;^^  0^^,  (45) 


where  V{REV)  is  variance  of  total  revenue^ g^  is  the  vari- 
ance of  e.,  o^_^-L  is  the  variance  of  the  index,  and  X^_^^  is 
"pseudo"  product  line  defined  as 


X  ^-,  =   ^   X,  B.  (46) 

n+1    .^^   X   1 


And  the  mean  value  of  total  revenue  is  obtained  by 


n 


REV  =   E   X,  A,  +  X_,  A^^,  (47) 

1=1 


"1     n+1   n+1 


Where  A   ,  is  the  expected  value  of  the  index. 
n+1 

Now,  if  it  is  assumed  that  all  the  product  lines 

are  prefectly  positively  correlated  to  each  other,  then  B^ = 

R   =     =  B   =  B*,  where  B*  is  the  slope  of  the  regression 

2    *  '  *     n 
line  fitted  to  past  series  data  on  total  revenue  and  the 

index  values.   In  that  case: 


X 


=   V   X.B.  =  B*  Z       X.  -  B*  (48) 


n+1    .I^L   ^  ^       i=l   "■ 


Replacing  (48)  in  (45),  gives  the  risk  associated  with 
total  revenue  in  the  absence  of  any  advantage  from  diversification 
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"22    -,*2   2  (49) 

i=l  . 

Reduction  in  this  risk  as  a  result  of  combining  product 
lines  which  are  not  perfectly  positively  correlated  to 
each  other  is  obtained  by  calculating  the  difference  be- 
tween equations  (49)  and  (45).   Again,  the  negative  of  this 
difference  is  the  risk  associated  with  product  lines. 

RSKPL  ^  -  [  Z  4aJ  +  B*2  o^^,  "  (.1  ^4   +  ^n+i^n+l^  ^ 
i=l  1=1 


or, 


n 


2         ^p,2  _  (  '^'   ,,^  ,2^  (50) 

n+1         i^i 


RSKPL  ==-a2    [B*^  -  (  ,  I   XiB.)  ] 


To  get  the  relative  risk  associated  with  product 
lines,  divide  both  sides  of  equation  (50)  by  the  raean 
value  of  total  revenue. 


n+1    rT,*2  _  /  V  y  R  ■)^1  (51) 


RRSKPL  =   -   -^^      [B*'  -  (/^/i^i^  1 
REV  1=1 

Risk  associated  with  places  occupj^d_jRSKPO)  . 
Another  factor  which  may  have  risk-reducing  effect  on  the 
firm's  position,  is  the  number  of  places  that  it  occupies. 
The  effect  comes  through  the  same  principle  as  applied  to 
product  lines.   Over  any  period  of  time,  many  factors, 
such  as  changes  in  the  competitive  forces,  changes  in  the 
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buying  population,  and  changes  in  the  purchasing  power  of 
the  inhabitants  of  the  location  v/hore  the  firm  has  its 
business  act  and  react  on  the  firm's  overall  performance. 

Supposedly,  diversification  into  several  places 
deducts  from  the  dependency  of  the  firm's  performance  on  a 
set  of  factors  peculiar  to  a  certain  location.   This  may 
mean  reduction  in  the  business  risk  of  the  firm,  but  this 
reduction  m.ny  not  materialize  if  some  conditions,  similar 
to  those  of  product  lines,  do  not  exist.   To  name  only  a 
few,  these  conditions  include  existence  of  negative  corre- 
lations between  each  pair  of  stores,  as  well  as  the  rela- 
tive importance  of  the  contribution  that  the  stores  with 
negative  correlations  make  to  total  revenue. 

It  seems  also  plausible  to  expect  that  the  reduc- 
tion in  risk  through  multiplication  of  stores  is,  ordi- 
narily, in  addition  to  that  which  is  resulted  from 
diversification  in  product  lines.   Otherwise,  the  mul- 
tiplication of  stores  would  have  a  negative  effect. 

The  provision  for  calculating  risk  associated  with 
number  of  stores  is  similar  to  that  for  product  lines. 
That  is,  the  firm  should  either  have  only  one  product 
line,  or  else,  all  product  lines  should  be  perfectly 
positively  correlated  to  each  other. 

To  calculate  RSKPO,  the  procedure  is  the  same  as 
that  applied  to  the  product  lines  case.   First,  suppose 
that  the  stores  in  different  places  are  perfectly 


positively  correlated  with  oach  other.   Thus 


n    n 
V(REV)  -  I         :.         Y  y.Q.Q  (52) 

i-1  j-=l    1  J  ^  J 


where  V(REV)  is  variance  of  total  revenue,  n  is  the  number 
of  stores,  and  Y.  and  Y.  are  the  historical  average  pro- 
portions of  total  revenue  which  come  frcin  stores  i  and  j, 
respectively,  and  Q.  Q.  are  standard  deviations  of  reve- 
nues from  stores  i  and  j,  respectively. 

Then,  consider  the  case  in  which  the  stores  are 
not,  revenue-wise,  perfectly  positively  correlated  with 
each  other.   In  that  case: 


V{REV)  =21      Y.Y.COR(i, j)O.Q- 
1=1  j=:l   ^  ^  ^    ^ 


(53) 


where  COR(i,j)  is  the  correlation  between  revenues  from 
stores  i  and  j . 

The  deduction  from  variance  of  total  revenue  as  a 
result  of  the  imperfectly  correJated  stores  is  the  differ- 
ence between  equations  (53)  and  (52).   Thus,  the  risk 
associated  with  stores  (although  negative)  is  the  negative 
of  such  difference: 


ESKPO 


[  r   Z  Y. Y.Q.Q.  -  E   L     Y.Y.COR(i,j)Q.Q.]   (54) 
i=l  j=l  ^  ^  ^  ^   i=l  j-1  ^  ^         ^  ^ 


This  expression,  after  some  manipulation  similar  to  that  in 
the  product  lines  case,  can  be  rewritten  as  the  folllowing: 
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n       n 
RSrCPO  =  -      r.        ;;     Y.Y.Q.Q.    [l  -  COR(i,j)l  ,    for  all   i*j         (55) 
i=l  j=l     -  ^   ^^ 

Which  can  be  put  in  a  relative  form  by  dividing  by  the 
mean  value  of  total  revenue: 

^    n   n 
FRSKPO  =  -  -^^  T.        Z     Y.Y.Q.Q.[l-<:OR(i,i)],  for  all  iT^j    (56) 
^^'      i=l  j-1  ^  ^  ^  ^ 

At  this  point,  it  should  be  noted  that  when  the 
firm  has  more  than  one  product  line,  or,  otherwise,  when 
the  product  lines  are  not  perfectly  positively  correlated 
to  each  other,  the  calculated  RSKPO  above  represents  the 
total  risk  associated  with  diversification  in  both  dimen- 
sions, product  lines  and  stores.   For  such  companies,  the 
value  of  this  combined  risk  should  conceivably  equal  that 
obtained  through  product  lines  analysis. 

The  risk  associated  with  places  that  the  firm 
occupies  is  not  limited  to  the  one  calculated  above.   It 
can  be  extended  to  cover  firms  which  operate  internation- 
ally.  Companies  which  are  multi-national  in  nature,  are 
constantly  facing  changes  in  exchange  rate,  changes  in 
foreign   governments'  policies,  and  changes  in  the  eco- 
nomic conditions  of  these  countries.   The  risk  associated 
with  some  of  these  factors  can  be  quantitatively  accounted 
for  in  the  analysis,  but  for  other  factors  qualitative 
measures  may  be  more  applicable. 


Rif;k  ar^sociatoci   with  CK;.hnrt'':e rite  (Rf-KXP)  .   The 

risk  associated  with  exchange  rate  is  suitable  for  treating 
quantitatively.   A  part  of  this  risk  comes  through  the  net 
income  before  taxes  contributed  by  the  firra's  divisions  in 
foreign  countries.   Another  part  comes  through  assets  of 
these  divisions  physically  placed  in  such  countries.   Ac- 
cordingly, we  may  express  this  risk  by  the  following  equa- 
tions : 

n  n 

RSKXRl  -   Z   X.CV(XR).  +  T.       Y.CV(XR).  (57) 

i=l   "^  i-1 

where  n  is  the  nuiaber  of  countries  in  which  the  firm  has  a 
division,  X.  is  the  uroDortion  of  net  income  before  taxes 
which  comes  from  the  division  in  i-th  country,  CV(XR) .  is 
the  coefficient  of  variation  of  the  exchange  rate  in  the 
i-th  country  over  some  period  of  time,  and  Y.  is  the  pro- 
portion of  the  assets  placed  in  the  i-th  country. 

The  first  term  in  equation  (57)  is  a  weighted 
average  of  coefficient  of  variations  of  exchange  rates  of 
the  related  countries,  whose  weights  are  the  x'elative 
importance  of  the  contribution  of  each  division  in  foreign 
countries  to  total  net  income  before  taxes.   The  second 
term  is  another  v.'eighted  average  of  such  coefficients  whose 
weights  are  proportions  of  assets  placed  in  each  country. 
The  more  is  the  company's  dependency  on  the  income  provided 
by  a  foreign  division,  and  the  more  it  has  engaged  its 
assets  there,  the  higher  is  its  risk  with  respect  to  the 


exchange  rate.   From  equation  (57),  it  follov;s  that,  for  a 
firm  vhich  operates  only  domestically,  X^'s  and  Y^'s  are 
equal  to  zero,  and  thus  it  docs  not  bear  any  exchange  rate 
risk . 

It  may  be  more  useful  to  supplement  the  above 
m.easure  v;ith  the  relative  measure  of  skev/ness,  the  stan- 
dardized third  moment  about  the  mean  of  each  of  the  ex- 
change rates.   The  negative  sign  of  the  m.oment  may  indicate 
depreciative  tendency  of  the  changes  in  the  exchange  rate, 
whereas  a  positive  sign  may  signify  appreciative  propensity. 
Since  there  are  no   positive  correlations  between  all  ex- 
change rates  at  all  times,  some  component  parts  of  the  risk 
may  be  cancelled  out,  if  the  weights  a^-e  in  appropriate 
proportions.   To  express  the  above  points  in  another  way: 

n  n 

RSKXR2  =   Z   X.(a  )^  +  T      Y^Ca^)^^  (58) 

i=l   ■*"     ■""    i  =  l 

where  (a-) .  is  the  standardized  third  moment  of  the  exchange 
rate  frequency  distribution  of  the  i-th  country  which  can 
be  obtained  by  applying  equations  (15)  and  (17). 

The  risk  associated  with  exchange  rate  is  then  a 
function  of  equations  (57)  and  (58).   Thus: 

RSKXR  =  e(RSKXRl,  RSKXR2)  (59) 

Risk  associated  with  age  (RSKAGE) .   The  relation- 
ship between  age  and  revenue  of  a  firm  was  discussed 
previously.   This  relationship  can  now  be  extended  to  the 


extreme,  the  firm's  financial  distress.   It  has  been  an 
observable  fact,  as  shown  by  any  listing  of  failed  com- 
panies, that  business  failures  occur  m;cre  frequently  in  a 
certain  age  group.   Conceivably,  this  age  group  differs 
from  industry  to  industry  due  to  the  peculiarities  and  the 
complexities  involved  in  and  the  experience  required  by 
each  industry.   In  general,  however,  younger  firms  are  more 
apt  to  fail  than  older  ones.   But  the  relationship  is  not 
linear.   In  fact,  over  a  certain  range  of  the  earlier  years 
the  function  is  increasing  until  it  reaches  a  peak  point 
and  then  it  diminishes.   Age,  thus,  seems  to  be  a  rela- 
tively important  determinant  in  business  failures. 

The  possibility  that  a  firm  faces  financial  dis- 
tress is,  therefore,  partially  dependent  upon  its  abso- 
lute age  as  well  as  its  relative  age  position  in  a  popula- 
tion of  failed  firms  in  its  industry.   To  find  the  risk 
that  a  firm  bears  by  being  in  a  certain  age  group  is,  then, 
to  find  the  probability  that  the  age  of  the  firm  is  the 
age  of  failure.   To  find  this  probability,  take  advantage 
of  the  Tchebycheff    inequality: 

P(A  <  d)  ^  a^/(E  -  d)^  (60) 

where  A  is  defined  here  as  failure  age  variable  whose  mean 

2 

value  and  variance  are  E  and  a    ,  respectively,  and  d  is 

any  specified  level  of  age  within  the  range  of  A. 


3  3 


RclatJor.  (60)  nay  now  be  interpreted  in  the  follow- 
ing way:   The  probability  that  the  failure  age  is  less  than 
or  equal  to  the  age  of  a  specific  fir.-n  (within  the  range) 
is  less  than  or  equal  to  the  ratio  in  the  right-hand  side 
of  the  inequality,  where  d  takes  the  value  of  the  firm's 
age.   It  is  seen  from  (60)  that  as  the  age  of  the  firm  (d) 
approaches  to  the  mean  value  (E) ,  which  is  the  weighted 
average  of  all  the  failed  firms  in  the  related  industry, 
the  probability  that  the  age  of  the  firm  is  the  age  of 
failure  increases.   In  the  extreme,  where  d  =  E,  this 
probability  would  becom.e  infinitely  large. 

This  relation  does  not  give  the  probability  for  a 
certain  age  alone;  rather  it  gives  a  cumulative  probability 
which  includes  those  of  the  lov.'er  ages.   To  get  the  "net" 
figure,  it  is  necessary  to  deduct  from  the  probability  of 
each  age  obtained  by  (60)  the  probability  of  the  immedi- 
ately lower  age,  as  also  obtained  by  (60).   It  should  be 
noted,  however,  that  all  the  probability  figures  obtained 
by  Tchebychef f ' s  inequality  are  much  too  conservative. 
This  conservative  property  increases  as  d  approaches  E. 
The  probability  figures,  nevertheless,  provide  a  useful 
set  of  weights  to  be  assigned  to  firms  as  risks  associated 
with  their  relative  ages. 

Risk  associated  with  expenses  (RSKEXP) .   Just  like 
the  case  for  any  other  variable  whose  value  may  vary  over 
a  possible  range,  expenses  are  also  treated  as  a  random 


variable.   This  being  so,  there  exists  a  possibility  that 
actual  expenses  differ  from  what  is  expected  at  any  point 
of  time.   This  possibility  is  v;hat  is  meant  by  risk  associa- 
ted with  expenses. 

Ordinarily,  it  is  expected  that  an  increase  in 
expenses  is  concurrent  with  an  increase  in  revenue  and  a 
decrease  in  the  former  occurs  at  the  same  time  as  a  de- 
crease in  the  latter.   But  this  may  not  be  what  actually 
happens;  movements  of  expenses  and  revenue  may  not  coin- 
cide.  Expenses  m.ay  go  up  at  the  same  time  that  revenue 
declines,  or  expenses  may  decline  when  revenue  climbs  up. 
Although  not  very  popular,  and  although  the  latter  case 
is  highly  desirable,  detection  of  the  existence  of  any 
such  relations  may  shed  some  additional  light  on  the  over- 
all risk  exposure  of  the  firm. 

One  way  to  find  out  about  such  disagreement  in  the 
behavior  of  expenses  and  revenue  is  to  consider  the  cor- 
relation between  them.   Correlation  between  expenses  and 
revenue  of  a  firm  can  be  obtained  by  utilizing  equation 
(22)  in  conjunction  with  equation  (21)  .   The  combiined 
formula  to  be  used  as  the  determinant  of  RSKXPl  is  pre- 
sented here: 

-,      n       

COR  (REV,  EXP)  =- I     (REV  -REV)  (EXP^-EXP)?^   (61) 

REV^EXP  i=l 

where  the  left-hand  term  is  the  correlation  coefficient 

between  revenue  and  expenses,  REV.  is  the  i-th  occurring 
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of  revenue  random  variable  whose  mean  and  variance  are  REV 

and  o„^,,,  respectively,  EXP.  is  the  i-th  occurrinq  of  ex- 
REV      '        ■'      1  ^ 

penses  random  variable  whose  mean  and  variance  are  EXP  and 
a    ,  respectively,  n  is  the  total  num.ber  of  occurrences, 
and  P.  is  the  joint  probabilitv  of  the  i-th  occurrence  of 
both  revenue  and  expenses. 

The  correlation  coefficient  takes  any  value  between 
-1  and  +1 .      Logical] y,  there  is  more  risk  attached  to  the 
cases  which  show  negative  correlation  between  expenses  and 
revenue  than  otherwise.   Furthermore,  the  closer  is  the 
negative  value  to  minus  one,  the  riskier  the  firm  appears 
to  be,  other  things  being  equal. 

Existence  of  negative  correlation  between  expenses 
and  revenue  m.ay  be  due  to  several  factors  such  as  the 
relations  between  fixed  costs  and  variable  costs,  the 
technology  that  the  firm  employs,  and  the  lag  between  ex- 
penses and  revenues.   In  some  cases,  further  exploration 
of  such  relations  may  be  rewarding,  but  this  discussion  is 
limited  to  the  seem.ingly  most  significant  ones. 

It  J s  obvious  also  that  the  firm  bears  less  risk 
if  the  major  expense  items,  i.e.,  selling,  general  and 
administrative  expenses  are  not  perfectly  positively  cor- 
related to  each  other.   Following  the  portfolio  concept, 
the  effect  of  such  correlations  on  the  variance  of  overall 
expenses  can  be  found  by  a  treatment  similar  to  that  ap- 
plied to  the  risk  associated  with  product  lines.   Thus: 


RSKXr'2  =  -  ^:^   ):    7.      X.X.>.a.  [l-COR(i,j)]  (62) 
EXP   i=i  j=l   1  3  1  J 

v;here  n  is  the  total  number  of  major  exoense  items,  X.  and 
X.  are  the  proportions  of  total  expenses  attributed  to 
major  ir.ems  i  and  j,  respectively,  a.  and  cf  .  are  standard 
devaitions  of  expenses  in  major  items  i  and  j,  and  COR(i,j) 
is  correlation  between  these  expenses. 

PJquat-ion  (62)  states  that  the  larger  is  the  nega- 
tive correlation  between  the  major  expense  items,  the 
larger  is  the  deduction  from  the  overall  risk  exposure  of 
the  firm,  and  vice  versa. 

In  sum,  risk  associated  with  expenses  is  a  func- 
tion of  the  above  components: 

RSKXP  =  f(RSKXPl,  RSKXP2)  (63) 

Risk  associated  with  labor  (RSPCLBR)  .   Closely 
related  to  the  risk  associated  with  expenses,  yet  indi- 
vidually of  high  importance,  is  the  possibility  that  the 
actual  behavior  of  the  workers  of  company  deviates  from 
what  is  expected.   With  an  ever-increasing  power  of  the 
unions,  firms  are  becoming  more  and  m.ore  vulnerable  to 
actions  taken  collectively  by  workers.   Any  strike,  any 
wage-increase  demand,  or  any  fringe-benefit  request  has 
drastic  effects  on  the  firm's  resources. 

A  firm  whose  nature  or  policy  does  not  warrant 
full  or  partial  application  of  automation,  relies  more  on 
labor.   But  the  higher  is  the  dependency  of  the  firm  on 
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labor,  the  higher  is  the  risk  that  the  firin  boars  in  this 
regard . 

To  deterrr.ine  the  degree  of  the  dependency,  first 
capitalize  the  labor  expenses  by  an  appropriate  discount 
rate.    Then  find  the  ratio  of  the  capitalized  labor 
expenses  to  total  human  and  non-human  assets.   The  higher 
is  this  ratio,  the  higher  is  the  risk.   The  reverse  is 
true,  cf  course,  for  the  lower  ratio. 

Another  determinant  of  the  labor  risk  is  the  com- 
petitive position  of  the  labor.   If  the  laborers  are  union- 
ized, they  are  more  forceful  and  impose  more  risk  on  the 
firm  than  otherwise.   On  the  other  hand,  if  the  firm  has 
a  monopsony  power  in  employing  the  labor,   its  position 
would  have  risk-reducing  effect.   Theoretically,  labor 
and  firms  could  take  any  position  between  perfect  com- 
petition to  monopoly  or  monopsony.   In  practice,  however, 
one  type  is  more  prevalent  and  that  is  the  unionization 
of  labor. 

At  any  rate,  the  risk  asssociated  v/ith  labor  may 
be  expressed  by  the  following  function: 

RSKLBR  =  g(LBRDEP,  L3RC0M)  (64) 

where  LBRDEP  is  labor  dependency  ratio  and  L3RC0M  is 
labor  competitive  position. 

Risk  associated  with  liquid  assets.   The  pos- 
sibility that  liquid  assets,  i.e.,  cash  and  marketable 


securities,  fail  to  protect  the  firm  from  possible  adverse 

deviatio:-is  of  net  cash-flow  from  its  expected  value  has 

27 

long  been  considered  as  risk  of  technical  insolvency. 

It  has  also  been  established  that  the  larger  is  the  pro- 
portion of  liquid  assets,  as  well  as  current  assets,  to 
total  assets,  the  smaller  is  the  risk. 

Also,  the  general  relationship  between  the  risk  of 
technical  insolvency  and  the  proportion  of  current  assets 
v.'hich  is  financed  by  short-terra  debt  is  indisputable.  A 
firm  that  finances  its  current  assets  entirely  with  equity 
capital  will  have  less  need  for  liquidity  than  it  would  if 
it  financed  these  assets  entirely  with  short-term  borrow- 
ings.  Thus,  for  every  level  of  current  assets,  such  firm 
bears  less  risk  than  otherwise. 

To  these  elements  should  be  added  the  maturity 
dimension  of  debts.   It  is  obvious  that  the  larger  is  the 
dispersion  of  the  maturity  of  debt,  the  less  is  the  risk 
of  technical  insolvency.   This  is  so  because  the  long 
intervals  bet\-/een  principal  payments  of  debt  give  more 
time  to  the  firm  to  earn  cash  or  to  convert  an  asset  into 
money  in  sufficient  amounts  to  meet  its  obligations. 

The  maturity  data  alone,  however,  may  not  help 
very  much  because  it  may  happen  that  in  spite  of  seemingly 
sufficient  dispersion  of  maturities,  the  firm  is  still 
unable  to  pay  the  principals  of  the  loans  due  to  their 
large  sizes.   It  may,  therefore,  be  useful  to  consider  the 
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average  sir.c'  of  payments  in  relation  to  liquid  assets  in 
conjunction  with  other  elements  discussed  above.   To 
sumjT'.arize ,  the  above  relations  m.ay  be  expressed  by  the 
following  function: 

RSJli2l  ---  h  (li^a^vST/TA,   CA/TA,   CA/CL,   NIATDSP,    SIZPAY/IX:D>"^T)         (65) 

where  LQDAST  stands  for  liquid  assets,  TA  for  total  assets, 
CA  for  current  assets,  CL  for  current  liabilities,  MATDSP 
for  maturity  dispersion  of  debts  and  SIZPAY  for  average 
size  of  principal  payments  of  debts. 

As  was  discussed  above,  except  for  the  last  ratio 
in  equation  (65),  all  other  elements  have  risk-reducing 
effects  as  they  become  larger.   Conversely,  the  larger 
becomes  the  size  of  principal  payments  in  relation  to  the 
size  of  liquid  assets,  the  larger  becomes  the  risk. 

It  may  also  be  helpful  to  supplement  (65)  with  some 
additional  information.   That  is,  to  provide  a  series  of 
probability  estimates  indicating  the  possibility  that  any 
of  the  elements  in  (65)  v;ill  be  at  or  below  a  disaster 
level,  given  their  related  frequency  distribution  over  a 
period  of  time. 

For  this  purpose,  calculate  the  mean  value  and 
variance  of  each  of  the  above-mentioned  elements  for  a 
company.   Also  estimate  the  disaster  level  of  any  such 
element  by  averaging  it  over  all  the  recently  failed 
firms  in  the  industry.   Then,  by  utilizating  Tchebychef f ' s 
inequality,  and  using  the  mean,  variance,  and  the  estimated 
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disaster  level  of  each  eJ.ement  above,  there  is  obtained  the 
related  probability.   The  interpretation  of  the  result  ob- 
tained in  this  way  is  straightforward   and  is  derived  frooi 
Tchebychef f ' s  inequality.   For  example,  in  the  case  of 
acid-test  ratio, 

P(LQDAST/TA  $  d  )  <  a^/ (E  "  d^ ) ^  (66) 

indicates  that,  given  the  mean  and  variance  of  the  ratio 
over  a  period  of  time,  the  probability  that  the  ratio 
falls  below  the  estimated  level,  d,  ,  is  equal  to  or  less 
than  the  right-hand  side  ratio.   A  simpler  interpretation 
may  be  obtained  by  inverting  risk-reducing  ratios,  but 
the  result  would  not  change.   Denoting  the  probabilities 
obtained  in  this  way  for  the  elements  in  (65)  by  P(l), 
P(2),  P(3),  P(4)  and  P(5),  respectively,  produces : 

RSKLQ2  =  i  [P(l),  P(2),  P(3),  P(4),  P(5)]    (67) 

And  the  risk  associated  with  liquid  assets  would  be  a 
function  of  the  parts  obtained  from  equations  (65)  and 
(67).   Thus: 

RSKLQD  =--  j(RSKLQl,  RSKLQ2)  (68) 

Risk  associated  with  securities  (RSKSEC) .   In 
firms  such  as  financial  institutions  or  securities  deal- 
ers, where  securities  other  than  the  marketable  ones 
(which  were  accounted  for  in  liquid  assets  discussion) 
constitute  a  significant  part  of  total  assets  of  the  firm. 
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the  risk  associated  with  their  collective  perforraance  plays 
an  important  role  in  determining  the  overall  risk  exposure 
of  the  firm. 

Finding  such  risk  does  not  seem  difficult  after 
so  m.uch  progress  has  been  m.ade  m  the  portfolio  field.   So 
far,  the  variance  of  the  return  on  the  securities  portfolio 
has  been  chosen  as  a  measure  of  risk.   But  to  obtain  it, 
it  is  not  necessary  to  resort  to  the  sophisticated  model 
of  portfolio  variance.   Use  can  be  made  of  the  historical 
data  of  the  return  on  each  security  in  simple  formulas. 
First  calculate  the  return  on  the  portfolio  in  each  period 
by  the  follo'.ving  equation: 


R(P)^  =       Y.       X.R.  (69) 

t    i=l   ^  ^ 


Where  R(P)   is  return  on  portfolio  at  period  t,  n  is  total 
number  of  securities,  X.  is  the  proportion  invested  in 
security  i  and  R.  is  return  on  security  i. 

Then  calculate  the  mean  value  of  the  portfolio 
return  over  the  period: 

~    1   "^ 
R{P)=  ^  I      R(P).  (70) 

^  t=l      ^ 

where  T  is  total  number  of  periods  under  consideration. 

The  variance  is  then  obtained  by: 

T 
V(P)  =  ~  Z       [R(P).  -  R(P)]^  (71) 

^  t-1 
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And  the  coefficient  of  variation  of  the  portfolio  by: 

CV(P)  =  V(P)/rTf)  (72) 

For  present  purposes,  consideration  of  the  port- 
folio variance  alone,  however,  does  not  seein  to  be  suffi- 
cient.  It  serms  useful,  therefore,  to  supplement  equation 
(72)  with  a  measure  of  skewness  as  well  as  rate  of  growth 
in  return  on  the  portfolio. 

The  relative  skewness  may  be  measured  by  stan- 
dardized third  inonient  about  the  mean,  as  discussed  before: 

1  1  T 

a^  =  m^  /  V(P)^  =  1A(P)^  •  VT  Z   [R(P),  -  RlP)]^       (73) 
■^    -^  t=l     ^ 

And  the  rate  of  grcv/th  of  return  is  obtained  by 
regressing  the  natural  log  of  the  return  on  portfolio  in 
each  period  against  time,  and  utilizing  the  following 
equation  resulted  from  such  regression  analysis: 

In  R(P)   =  In  A  +  rt  (74) 

where  In  R(P)  is  the  natural  log  of  the  portfolio  return 
at  time  t.  In  A  is  y-intercept  of  the  line,  and  r  is  the 
growth  rate. 

In  evaluating  the  risk  of  portfolio,  the  coef- 

2 
ficient  of  determination,  R  ,  obtained  from  the  regression 

analysis,  is  also  used  along  with  other  factors,  because 
it  contains  some  additional  information  about  the  disper- 
sion of  the  portfolio  return. 
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The  final  point  to  be  rr.ade  is  that,  since  portfolio 
nagc:nent  is  a  dynamic  process  in  nature,  it  cannot  be 
assumed  that  the  contents  and  the  size  of  the  portfolio  do 
not  change  over  a  period.   Thus,  the  risk  associated  with 
the  portfolio  also  changes  over  a  period.   In  order  to 
evaluate  the  risk,  however,  some  attempt  to  treat  the 
problem  must  be  made.   One  possible  way  is  to  take  the 
weighted  average  of  all  the  factors  determining  the  risk 
over  the  last  period.   The  weights,  however,  would  be  two 
dim.ensional.   The  first  dimension  relates  to  the  propor- 
tional size  of  each  portfolio  variant.   The  second  one 
relates  to  the  proportion  of  period  over  which  the  port- 
folio variant  was  held  by  the  firm.   As  an  exam.plc,  the 
weighted  average  of  portfolio  variance  is  obtained  by  the 
following  equation: 


vTp)  =   ^  V(P).S.T 
i-1 


(75) 


where  n  is  total  num.ber  of  the  portfolio  variants,  V(P)^ 
is  the  portfolio  variance  of  the  i-th  variant,  S^  is  the 
proportional  size  of  the  i-th  variant,  and  T^  is  the 
proportional  period  (a  fraction  of  365  days,  for  example) 
over  which  the  i-th  variant  has  been  held  in  the  last 
period.   Thus,  the  risk  associated  with  securities  is  a 
function  of  the  following  factors: 

RSKSEC  -  [cV(r) ,  ¥3,  F,  r"]  (76) 


where  the  br.vs    indicate  the  weighted  average  of  the  related 
factors. 

Risk  associated  with  credit  extending  (RSKCRD) . 
Firms  may  extend  credit  to  otiier  firms  or  individuals 
through  either  Joan  or  accounts  receivable.   The  incentive 
is,  of  course,  to  increase  the  sales  and  profits.   Whether 
in  the  form  of  loan,  accounts  receivable,  or  any  other 
form,  credit  extension  carries  with  itself  some  risk. 
The  risk  associated  with  credit  extension  is  the  possi- 
biliry  that  the  actual  result   from  such  action  deviates 
from  what  is  expected. 

This  risk  has  two  facets:  1)  The  extension  of 
credit  is  based  upon  the  expectation  that  it  increases 
sales  and  profits,  and  2)  It  is  expected  that  the  credit 
receivers  do  not  default  in  repaying  the  entire  credits 
extended  to  them.  Thus,  the  larger  is  the  deviation  of 
actual  results  from  such  expectation,  the  larger  is  the 
related  risk. 

It  is  not  easy,  however,  to  find  out  whether  the 
expectation  of  increasing  sales  and  profits  as  a  result 
of  credit  extension  is  met  or  not.   Nevertheless,  a  way 
should  be  found  to  estimate  its  associated  risk.   One 
possible  way  seems  to  be  the  determination  of  the  correla- 
tion between  credits  and  profits  over  an  appropriate 
period  of  time.   The  correlation  coefficient  can  be  cal- 
culated by  the  following  equation: 
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COR  (MBIT,  CRDT)    -  i— _.-     z    (NBIT  -NBIT)  (CPJYT  -CrOT)  P  (77) 

'^NcIT^CrNDT  t=l 

where  the  left-hand  term  denotes  the  correlation  coeffi- 
cient betv;eeu  net  incciTie  before  interest  and  taxes  and 
total  credit,  and  on  the  right  T  is  the  number  of  periods 
under  consideration,  NBIT   is  net  income  before  interest 
and  taxes  in  period  t,  NBIT  and  '^j^gj-p  ^^^    the  mean  and 
standard  deviation  of  the  incoi'.ie  over  T  periods,  CRDT^  is 
total  credit  extended  in  period  t,  CRDT  and  c^j^p,^  are  the 
mean  and  standard  deviation  of  credit  over  T  periods,  and 
P   is  the  joint  probability  of  the  occurrence  of  NBIT 
and  CRDT   at  time  t. 

A  positive  correlation  coefficient  may  indicate 
that  actual  performance  of  credit  has  corresponded  to  what 
has  been  expected.   Conversely,  a  negative  coefficient  may 
signal  the  deviation  of  actuality  from  expectations.   The 
larger  is  the  absolute  value  of  the  negative  correlation, 
the  larger  is  the  deviation,  and  the  larger  is  the  risk 
undertaken  by  extending  the  credit. 

If  there  were  not  external  forces  at  work  when 
the  credit  extension  were  changing,  all  the  changes  in  net 
income  could  be  attributed  to  the  variation  in  credit,  but 
at  any  point  of  time  numerous  factors  act  and  react  on 
net  income.   In  particular,  credit  is  usually  eased  at  the 
time  when  the  market  is  weak.   It  is  eased  to  induce 
potential  buyers  to  buy  and  thus  to  prevent  sales  from 
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doclininq.   On  the  other  hand,  wher.  demand  is  hicjh,  there 
is  no  need  to  ease  credit.'   In  sum,  however,  the  correla- 
tion coefficient  seems  to  be  a  useful  n\easure  in  estimating 
the  risk,  but  reliance  on  tyiis  factor  alone  does  not  seem 
appropriate.   Furthermore,  if  the  correlation  is  positive, 
there  still  is  needed  some  measure  to  deal  with  the  second 
facet  of  the  credit  risk,  the  possible  default  of  the 
credit  recipients.   Recourse  can  be  made,  therefore,  to  a 
set  of  other  determ.inants  which  may  corroborate  the  cor- 
relation coefficient. 

The  possibility  that  the  credit  receivers  may  de- 
fault in  repaying  a  part  or  the  entire  credit  extended  to 
them  is  a  function  of  several  factors.   In  particular, 
if  a  loan  is  a  secured  one,  the  associated  risk  is  usually 
low,  depending  upon  the  nature  of  the  pledge.   On  the 
other  hand,  when  it  is  unsecured,  the  quality  of  the  ac- 
counts and  the  credit  period  affect  the  risk.   The  better 
is  the  quality  of  accounts,  the  lower  is  the  risk.   But 
the  longer  is  the  credit  period,  the  higher  is  the  risk. 

The  quality  of  accounts  carried  by  the  firm  may 
be  approximated  by  the  bad-debt  losses  (in  percentage  of 
annual  sales)  and  the  average  collection  period.   The 
firms  v.'ith  easy  credit  policy  have  both  higher  bad-debt 
losses  and  longer  collection  periods,  whereas  the  reverse 
is  true  for  the  firms  with  tighter  credit  policy. 
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Another  feature  of"  a  loan  v;hich  ha,?  an  effect  on 
risk  is  its  callability  (or'  withdrawal  agreement).   This 
feature  gives  the  lender  firn  more  flexibility  in  finan- 
cing its  fund=^  requirements  and  in  adjusting  their  related 
interest  rates. 

In  surriraary,  the  risk  associated  with  credit  ex- 
tending may  be  expressed  by  the  following  function: 

RSKCRD  =  1  [COR(NBIT,CRDT) ,  UNSEC/CRDT,  BADDBT/ 

SALES,  AVGCRP,  ACGCLP ,  CALL/CRDT]      (78) 
where  UNSEC  stands  for  unsecured  credit,  BADDBT  is  for 
bad-debt  losses,  AVGCRP  for  average  credit  period,  AVGCLP 
for  average  collection  period,  and  CALL  for  total  callable 
credit  or  loan. 

Risk  associated  with  inventories  (RSKINV) .   In- 
ventories provide  a  very  important  link  between  purchase 
and  sale  or  production  and  sale.   If  inventories  are  not 
in  sufficient  and  balanced  am.ounts,  either  customers  are 
hurt  and  sales  deteriorate,  or  production  is  hindered 
and  resources  lie  idle.   On  the  other  hand,  if  inven- 
tories are  in  excess  of  what  are  actually  needed,  there 
would  be  a  lot  of  losses  involved  in  carrying  costs, 
opportunity  costs,  possible  obsolescence,  price  changes, 
theft,  and  spoilage. 

The  risk  <-iSsociated  with  inventories  is,  then,  the 
possibility  that  actual  amounts  deviate  from  what  is 
required.   This  possibility  increases  as  the  lead  time 


9  3 


required  to  receive  delivery  of  inventory,  once  an  order 
is  placed,  is  lengthened.   Existence  of  monopoly  on  the 
side  of  the  supplier  of  raw  raaterials  also  adds  to  the 
risk  of  inventory.   Conversely,  monopsony  power  of  the  firm 
in  buying  the  supplies  has  risk-reducing  effect. 

Despite  the  constant  efforts  of  theoreticians  and 
practitioners  to  deteriaine  the  optimal  level  and  combina- 
tion of  inventories,  deviations  of  actuals  from  required 
do  occur  in  practice.   Inventory  turnover  ratio  is  usually 
used  to  measure  such  deviations.   A  high  turnover  ratio 
may  indicate,  among  other  things,  that  the  stock  of  in- 
ventory is  low-   Similarly,  overstock  may  be  shown  by  a  low 
turnover  ratio.   But  it  is  difficult  to  indicate  that  the 
turnover  ratio  of  a  firm  is  high  or  low,  unless  there  is  a 
criterion  to  be  compared  v/ith. 

One  way  to  meet  this  purpose  is  to  assume  that  a 
large  number  of  similar  firms  in  an  industry  which  are 
operating  successfully  have  been  using  a  correct  level  of 
inventories.   Thus,  averaging  the  ratio  over  such  firms 
gives  a  criterion  for  comparison.   Therefore,  for  any 
similar  firm,  deviation  of  its  ratio  from  this  amount  in 
the  last  period  is  an  indic^^tion  of  incorrect  level  of 
inventory  for  the  coming  period.   Thus,  a  large  deviation 
indicates  a  high  risk,  and  a  small  deviation  signifies  a 
low  risk.   To  summarize,  the  risk  associated  with  inven- 
tories is  a  function  of  the  following  com.pcnents : 
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RSKINV  =  ni(rF.VTNV,  SUPCOM,  LEADTM)  (79) 

where  DEVINV  is  the  deviation  of  inventory  turnover  ratio 
of  the  firir,  from  the  averag',:  ratio  of  the  successful  firms 
in  the  sane  period,  SIJPCOM  is  the  corrrpetitivo  position  of 
the  suppliers  or  the  firm  (whichever  is  more  important  to 
the  firm  in  determining  its  risk)  with  respect  to  supplies 
and  raw  materials,  and  LEADTM  is  for  lead  time. 

Risk  associated  with  fixed  assets  (RSKFA) .   The 
risk  associated  with  fixed  assets  is  the  possibility  that 
the  actual  services  renderable  by  these  assets  deviate  from 
what  is  expected.   The  sources  of  such  deviations  vary. 
They  may  com.e  from  technological  advancements  v;hich  render 
the  assets  obsolete,  or  they  may  be  due  to  untimely  physical 
depreciation.   They  may  also  come  through  inflation.   In- 
flation makes  the  funds  allocated  through  accounting 
depreciation  insufficient  to  restore  the  assets'  earning 
power. 

The  longer  is  the  life  of  the  fixed  assets,  the 
more  is  the  possibility  of  their  obsolescence  and  untimely 
depreciation,  and  thus  the  higher  is  the  risk  of  losing 
their  earning  power.   Conversely,  the  shorter  is  the  life, 
the  lower  is  the  risk.   Also,  the  higher  is  the  average 
inflation  rate  over  the  period  of  tim.e  under  consideration, 

the  higher  is  the  possibility  that  the  accounting  de- 

2  8 
preciation  deviates  from  the  economic  depreciation   of 
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the  assets,  and  thus  the  higher  is  the  risk  associated  with 
the  assets. 

Another  factor  which  may  have  some  effects  on  the 
assets  risk  is  the  dispersion  of  the  times  at  which  the 
economic  lives  of  major  items  of  the  fixed  assets  expire. 
The  smaller  is  this  dispersion,  tlie  more  difficult  it  is 
to  replace  several  useless  major  items  in  a  short  period 
of  time.   This  may  affect  the  earning  power,  and  thus 
creates  more  risk.   Conversely,  the  larger  is  the  disper- 
sion, the  lower  is  the  risk.   The  above  discussion  can  be 
summarized  by  the  follov;ing  function: 

RSKFA  =  n(LIFE,  INFLAT,  DISPER)  (80) 

where  life  is  the  economic  life  of  the  fixed  assets, 
INFLAT  is  the  average  rate  of  inflation  over  the  period 
under  consideration,  and  DISPER  is  the  dispersion  of  times 
at  which  the  economic  lives  of  major  items  of  the  fixed 
assets  are  expired. 

Risk  associated  with  patents  (RSKPAT) .   The  value 
of  patents,  goodwill,  franchises,  and  memberships  is  sub- 
ject to  changes.   The  possibility  that  their  actual  values 
deviate  from  what  were  expected  constitutes  the  associated 
risk.   The  deviation  may  be  the  result  of  technological 
progress,  changes  in  market  factors,  or  changes  in  law  and 
regulations.   At  any  rate,  it  seems  that  the  risk  is  a 
function  of  the  period  that  the  asset  has  been  in  actual  use 
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(PRINUS).  In  the  case  of  a  patent,  for  example,  it  seems 
that  the  longer  it  has  hocn    in  use,  the  higher  is  the  prob- 
ability that  som-othing  similar  or  even  better  will  be  in- 
vented.  Tlius: 

RSKPAT  =  o(PPINUS)  (81) 

Risk  associated  with  debt  (RSKDBT) .   Much  has  been 
said  about  financial  leverage  and  financial  risk.   For 
present  purposes  risk  associated  with  debt  is  defined  as 
the  possibility  that  the  actual  ability  of  tlie  firm  to  pay 
its  debt  obligations,  both  principal  and  interest,  deviates 
from  what  is  expected.   By  increasing  the  proportion  of 
senior  capital  in  the  capital  structure,  a  firm  increases 
its  financial  leverage.   The  higher  is  the  level  of  lever- 
age, the  higher  is  the  possibility  of  its  inability  to  pay, 
other  things  being  equal,  and  vice  versa. 

But,  what  is  the  disaster  level  of  leverage  below 
which  there  is  a  very  high  degree  of  probability  that  the 
firm,  will  face  financial  distress?   Theoretically,  the 
answer,  which,  of  course,  depends  on  the  peculiarities 
of  the  firm  and  of  the  industry  to  which  it  belongs,  has 
not  yet  been  found.   Empirically,  it  can  be  assumed  that 
all  failed  firms  in  the  sam.e  industry  have  had  their  share- 
holders' equity  capital  at  or  below  a  disaster  level. 

Thus,  by  averaging  the  proportion  of  the  equity 
capital  in  the  capital  structure  of  a  11  such  firms 
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(preferably  of  only  thone  firms  which  are  similar  to  the 
firm  under  consideration,  v/ith  respect  to  the  size  and  busi- 
ness nature,  if  their  number  is  large  enough)  and  utilizing 
the  Tchebycheff  inequality,  it  is  possible  to  obtain  an 
approximate  probability  that  the  firm's  equity  capital 
would  be  at  or  below  the  disaster  level, 

P(EQCAF/TA  $  d)  i    a^/(E  -  d)^  (82) 

v/here  EQCAP  is  equity  capital  of  the  firm  v/hose  mean  and 
variance  over  a  period  of  past  years  is  E  and  a    ,    re- 
spectively, TA  is  total  assets,  and  d  is  the  disaster 
level. 

In  addition  to  the  level  of  debts,  their  average 
maturity  has  also  some  effects  on  the  related  risk.   The 
longer  is  the  maturity,  other  things  being  equal,  the  more 
time  is  available  to  the  firm  to  provide  necessary  funds, 
and  thus  the  lower  is  the  risk,  and  vice  versa. 

Not  only  the  maturity  of  debts,  but  the  dispersion 
of  the  paym.ent  dates  also  affects  the  risk.   The  greater 
is  the  dispersion,  the  less  is  the  burden  of  payments  at 
any  point  of  time  and  the  easier  is  the  provision  of  the 
required  funds,  other  things  being  equal.   Conversely, 
smaller  dispersion  imposes  more  risk  on  the  firm. 

If  some  of  the  debts  have  call  feature  in  favor 
of  the  firm,  it  has  a  risk-reducing  effect.   Conversely, 
if  the  call  feature  is  for  the  creditors,  then  the  risk 
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of  the  firm  is  increased.   At  any  rate,  a  measure  of 
weighted  average  of  such  features,  negative  for  cases  in 
favor  of  the  firm  and  positive  for  those  against  the  firm,, 
should  be  included  in  the  evaluation. 

Another  factor  is  accessibility  of  the  firm  to  the 
capital  market.   A  firm  which  has  access  to  the  capital 
market,  in  addition  to  being  able  to  obtain  cheaper  funds, 
faces  less  risk,  because  there  is  less  possibility  of  being 
trapped  financially. 

In  summary,  the  risk  associated  with  debt  is  a 
function  of  the  following  factors: 

RSKDBT  =  p[P(EQCAP/TA  <:  d)  ,  AVGMAT,  PAYDIS, 

CALLFF,  CALLAF,  CAP ACS]  (83) 

where  AVGMAT  is  the  v;eighted  average  debt  maturities, 
PAYDIS  is  a  measure  of  the  dispersion  of  the  payment  dates, 
CALLFF  and  CALLAF  are  the  weighted  average  of  a  measure  of 
call  feature  for  and  against  the  firm,  respectively,  and 
CAPACS  is  the  capital  market  accessibility  of  the  firm. 

Risk  associated  with  changes  in  economic  and 
market  factors  (P.SKECM)  .   The  possible  relation  betw'een 
a  firm's  revenue  and  expenses  with  the  levels  of  some 
important  economic  and  market  factors  in  the  corresponding 
periods  provides  a  means  of  evaluating  the  profitability 
of  the  firm  under  various  conditions,  including  different 
phases  of  business  cycles,  which  seems  to  be  essential  in 
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differentiating  between  firms  which  operate  saccessfully 
and  those  which  fail.   The  possible  effects  on  the  per- 
formance of  the  firm  in  the  next  period  of  future  changes 
in  econoinic  and  market  factors,  hov;eyer,  constitute  an 
additional  risk  factor  of  the  firm. 

Changes  in  the  economic  system  are  predicted  via 

two  general  approaches.   The  first  approach  utilizes  the 

29 

properties  of  some  economic  indicators;    the  second  relies 

30 
on  GNP  models.     The  indicators  herald  the  upturn  or  down- 
turn of  the  economy  but  do  not  predict  the  absolute  values 
of  the  GNP  or  any  other  variable.   GNP  models,  on  the 

other  hand,  predict  absolute  levels  but  do  not  identify 

•  4   31 
turning  points. 

Since  the  possibility  of  a  firm's  future  failure  is 
more  likely  related  to  the  level  of  future  downward  changes 
in  the  general  business  activities  than  to  the  mere  change 
itself,  it  seems  more  appropriate  to  take  advantage  of  one 
or  more  variables  of  a  GNP  model,  rather  than  using  the 
economic  indicators.   Obviously,  no  one  variable  can  be 
equally  helpful  for  all  industries  in  determining  the  risk 
associated  with  changes  in  the  economic  factors.   Whereas 
the  predicted  change  in  government  spending  on  defense 
materials,  for  example,  has  a  great  impact  on  defense- 
related  industries,  it  may  not  have  any  influence  on  non- 
defense  businesses,  and  so  on. 
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Therefore,  tiie  selected  forecasting  variable  (s) 
should  fit  the  industry  under  consideration.   For  conve- 
nience changes  in  such  a  variable  (s)  are  referred  to  as 
AFGBA,  i.e.,  change  in  a  factor(s)  of  general  business 
activity. 

Other  deterrainants  of  the  risk  associated  with 
changes  in  economic  and  ir^arket  factors  are  population 
trends  of  the  county  in  which  the  firm  has  business 
(POPTRN)  ,  changes  in  the  age  coraposition  of  the  population 
(ACPOP) ,  changes  in  the  competitive  forces,  such  as  net 
change  in  the  number  of  competitors  or  predicted  change  in 
the  company's  market  share  (ACOMP) ,  predicted  change  in  en- 
vironmental factors  (ae:NVRN)  . 

In  sumjiiary,  the  risk  associated  with  changes  in 
economic  and  market  factors  is  a  function  of  the  following 
elements : 

RSKECM  =  q(AFGBA,  POPTRN,  ACPOP,  ACOMP,  ALAJ-7,  AJ3WRN)   (8  4) 

In  concluding  the  risk  section,  it  should  be  added 
that  there  may  be  some  additional  risk  factors  which  may 
have  some  effects  on  the  firm's  overall  risk  exposure,  but 
it  seems  that  all  the  important  ones,  at  least  for  present 
purposes,  have  been  included  in  the  previous  discussion. 

Mism.a  nagem  e  n  t 
In  the  previous  two  sections  all  the  factors  af- 
fecting the  first  two  elements  of  equation  (1),  i.e.. 
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ca?h-flow  and  risk,  were  discussed  at  some  Icnqth,   In  this 
section,  the  third  element  of  the  L.;uation,  i.e.,  MISMGT, 
is  discussed,  however,  rather  briefly. 

The  reason  for  such  treatment  of  the  mismanagement 
element  is  the  difficulty  in  generalizing  factors  v/hich  may 
be  cOi'i'jaon  to  all  firms  as  having  effects  in  driving  a  firm 
out  of  business,  yet  do  not  fall  within  the  domain  of  the 
factors  discussed  so  far.   To  be  sure,  nevertheless,  there 
may  be  found  some  factors  which  have  these  characteristics. 

Fraud  is  one  of  such  factors.   It  is,  however,  an 
after-fact  phenomenon.   Its  existence  cannot  be  known  until 
after  it  is  discovered.   Embezzlement  is  another.   Manipula- 
tion, misconduct,  poor  books  and  records,  improper  control, 
too  rapid  expansion,  and  overcomir.itment  in  a  particular 
venture  are  other  such  factors  which  may  also  be  included 
in  the  same  category. 

The  effects  of  these  factors  varies ,  of  course,  indif- 
ferent industries.  Whereas  embezzlement  may  be  a  significant 
factor  in  the  financial  industry,  for  example,  poor  books 
and  records  may  play  a  vital  role  in  securities  industry. 

The  possible  usefulness  of  including  the  mis- 
management element  among  the  potential  determinants  of 
the  probability  of  financial  distress  of  a  firm  may  be 
better  seen  through  an  example  from  the  securities 
industry.   It  has  been  an  observable  fact  in  recent  years 
that  the  probability  of  financial  distress  of  a  brokerage/ 
dealer  firm  is  directly  related  to  fraud  or  back  office 


107 


problems.  In  fact,  the  f re'quencies  with  which  possible 
fraud  or  misconduct  and  poor  books  ^md  records  were  ob- 
served in  the  94  liquidated  iiriTiS  handled  by  SIPC  during 

32 
1971-73  were  55  and  04,  respectively. 

In  suR-'unary,  mismanagement  may  be  expressed  as  a 

function  of  the  following  factors: 

MISMGT  =  r (FRAUD,  EKBEZ ,  MANIPU,  MISCND,  IMPCNT, 

PRBKRC,  RAPEXP,  OVCOMT)  (8  5) 

where  FRAUD  stands  for  fraud,  EMBEZ  for  embezzlement, 
MANIPU  for  manipulation  of  the  various  accounts,  MISCND 
for  misconduct,  IMPCNT  for  improper  control,  PRBKRC  for 
poor  books  and  records,  RAPEXP  for  rapid  exp>ansion,  and 
OVCOMT  for  overcommitment  in  a  particular  venture. 

Summary  and  Conclusion 

In  an  effort  to  present  an  analytic  framework, 
potentially  applicable  to  all  profit-oriented  private  or- 
ganizations, for  identifying  potential  determinants  of  a 
firm's  financial  distress,  it  was  hypothesized  that  the 
probability  of  such  distress  is  a  function  of  three 
elements:   net  cash-flow  per  dollar  of  assecs,  risk  ex- 
posure, and  mismanagement  of  the  firm. 

Because  the  specific  relations  between  these 
elements  and  the  probability  of  the  financial  distress 
could  not  be  known,  the  proposed  general  relation  was 
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expanded  to  include  all  possible  factors  affecting  the 
above-monti onca  elements.   It  was  suggested  that  by  ex- 
posing this  expanded  relation  to  a  statistical  analysis, 
it  may  be  possible  to  obtain  a  specific  relation  between 
some  of  the  factors  v/hich  are  more  important  than  others  in 
determining  the  probability  of  the  financial  distress  of  a 
firm  in  a  specific  industry. 

In  identifying  the  factors  which  affect  a  firm's  net 
cash-flow  per  dollar  of  assets,  the  revenue  side  and  the 
expense  side  of  the  firm  were  treated  separately.   The  ef- 
fects of  all  internal  as  well  as  external  factors  on  both 
revenue  and  expenses  were  then  exam.ined  individually.   The 
result  was  the  emergence  of  a  set  of  general  relations 
numbered  (4)  through  (11)  . 

The  treatment  of  the  risk  element  w'as  organized  by 
first  adopting  a  definition  of  risk.   It  was  defined  as 
the  possibility  that  actual  outcome  of  a  random  variable 
will  deviate  from  that  which  is  expected.   Then,  variance 
or  standard  deviation  of  a  single  variable,  semi-variance, 
third  moment,  rate  of  growth,  confidence  level,  covariance, 
correlation  coefficient,  variance  of  a  group  of  variables, 
and  the  probability  of  disaster  level  were  discussed  as 
possible  measures  which  could  individually,  or  in  combina- 
tion, determine  the  extent  of  a  risk  associated  with  any 
factor.   Finally,  risk  factors  which  collec^iively  determine 
the  risk  exposure  of  the  firm  were  examined  in  detail. 
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Included  wore  risks  as^^ociatc-d  with  cash-f l^^w,  product 
lines,  pieces  occupied,  aqe  of  the  fir'ni,  expenses,  liquid 
assets,  credits,  inventories,  fixed  assets,  paterits,  bor- 
rov.'ing,  and  clianges  in  economic  and  market  factors. 

The  last  section  di  scussed  brief  ly  the  suitability 
of  including  in  cur  analysis  some  other  factors  which  do 
not  fall  in  tl-.e  domain  of  the  other  two  elements  dis- 
cussed above  but  may  have  some  impact  on  the  possibility 
of  a  firm's  financial  distress.   This  resulted  in  the 
MISMGT  relation  numbered  (85) . 

Now,  by  replacing  the  subordinate  functions  into 
the  secondary  ones,  and  the  secondary  ones  into  the 
origin3l  one,  there  is  obtained  the  expanded  form  of  equa- 
tion (1)  that  contains  all  of  the  factors  which  possibly 
have  effect  on  the  probability  of  a  firm's  financial  dis- 
tress.  It  is  this  last  function  which  is  subject  to  an 
empirical  test  to  be  conducted  by  using  multivariate 
discriminant  analysis  technique. 
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CHAPTER  III 
MULTIVARIATE  DISCRIMINANT  A^^ALYSIS 

Introduction 

Multivariate  discrimiuant  analysis  (MDA)  is  a 

statistical  technique  which  can  be  used  to  describe  the 

differences  botv/een  two  or  more  groups  and  to  classify  an 

individual  into  one  of  several  populations  based  on  a  set 

of  characteristics  v/hich  are  observable  in  che  members  of 

the  groups.   More  formally,  suppose  that  there  are  k  groups 

or  populations,  tt  ,  rr  ,  ...,  n    ,    v.'ith  a  m.ean  vector  of  p 

measurem.ents ,  p  ' .  =  (v  ■  -,  i    y  ■  -,  f  ...;!.  )  ,  for  i  -    1 ,  2  ,  .... 
1      ila.^       ip 

k.   The  problemi  to  be  solved  by  MDA  is  to  m.easure  the 
distance  between  these  groups  and  to  assign  an  individual 
with  a  vector  of  measurements  x'  =  (x, ,  Xp ,  ...,  x  )  to 
one  of  those  k  populations  to  which  it  has  the  closest 
sim.ilarity  vjith  respect  to  such  measurements. 

In  practice,  however,  the  m.ean  vector  of  popu- 
lations is  not  known  and  should  be  estimated  by  samples 
taken  from,  such  populations.   In  fact,  the  important 
characteristics  of  the  populations  are  unknown,  as  well. 
The  MDA  technique  can  be  used  in  such  a  way  that,  by  using 
a  sample  from  each  populatioii,  the  significant  features  of 
the  populations  are  exti acted  first  and,  then,  a 
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classification  rule  is  established  on  the  basis  of  those 
features.   The  procedure  of  MDA  m,r;y  bo  better  illustrated 
by  using  an  example  given  by  Duda  and  Hart,   although  in  a 
"pattern  classification"  context. 

Consider  a  lumber  rrdll  which  v/ants  to  automate  the  pro- 
cess of  sortincf  finished  lumber  according  to  species  of  tree. 
A  system  to  perform  this  task  night  have  the  form  shov/n  in  Fig- 
ure 1.   The  camera  passes  the  data  to  the  feature  extractor. 
The  latter  selects  and  measures  some  of  these  features  and  pas- 
ses them  to  the  classifier.   The  classifier  then  evaluates  the 
presented  evidence  and  mades  a  decision  about  the  lumber  type. 


CTO.  j=^  CLASS.P.ER  ^^   DECS.O: 


FIGURE  1.   A  PATTERN  CLASSIFICATION  SYSTEM. 

Source:   Duda  and  Hart's  book  (see  note  1).   Reproduced  by 
permission. 

In  applying  the  MDA,  the  work  of  a  researcher  may  be 

likened  to  that  of  the  transducer  in  the  above  system. 

The  researcher  considers  all  of  the  relevant  features 

and  properties  which  may  contribute  to  distinguishing 

between  groups.   He  then  makes  them  available  to  the  MDA. 

A  part  of  the  £inalysis  performs  the  task  of  the  feature 

extractor.   This  is  done  by  a  stepwise  procedure  in  which 

the  "best"  variables  arc  selected.   Based  on  the  selected 

variables  one  or  more  discriminant  functions  are  constructed 
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and  a  classification  rule  is  established  to  act  as  the 
classifier  in  the  above  system.   These  features  and  rules 
are  then  used  to  classify  a  now  observation  into  an 
appropriate  group. 

The  iiiqhly  desirable  features  of  xMDA  have  rapidly 
expanded  its  application  from  its  initial  use  in  the  fields 
of  anthropology  and  plant  taxoncny  to  such  other  diverse 
fields  as  biology,  medicine,  education,  psychology,  and 
economics.   Yet,  all  its  potentialities  have  not  been 
exhausted,  and  it  is  expected  tliat  its  application  v;ill 
extend  to  a  wide  variety  of  new  fields  in  the  near  future. 

It  is  congruent  with  these  features  that  the 
present  study  attempts  to  apply  the  MDA  technique  to  the 
problem  of  identifying  the  factors  affecting  the  financial 
distress  of  a  securities  brokerage/dealer  firm  and  predict- 
ing its  performance,  in  terms  of  potential  bankruptcy  or 
ncnbankruptcy ,  for  a  sufficiently  long  period  of  time  in 
advance . 

The  purpose  of  this  chapter  is  to  present  an 
overall  picture  of  the  MDA  technique  in  application  in  a 
summary  form.   The  mathematical  form.ulat ions  are  mentioned 
without  trying  to  show  their  derivations,  and   the  related 
theories  are  discussed  only  up  to  a  point  where  they  have 
direct  bearing  on  the  understanding  of  the  application. 
A  section  on  the  historical  development,  however,  provides 
sufficient  references  for  a  reader  who  wislies  a  deeper 
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knowlcdqe  on  the  subject.   A  discussion  of  the  general 
assunptions  gives  an  idea  about   the  conditions  under  wnic: 
the  results  are  ir.ore  reliable.   Relation  to  rearession 
analysis,  relative  importance  of  variables,  reliability 
tests,  and  the  available  computer  programs  are  contingent 
topics  to  be  discussed.   The  chapter  concludes  after  a 
brief  survey  on  the  application  of  MDA  in  the  financial 
area . 

General  Assumptions 

The  MDA  technique  is  based  on  the  f ollov.'ing  three 
general  assumptions: 

(1)   The  populations  are  identifiable  and  can  be 
distinguished  from  each  other.   In  fact,  some  authors  re- 
quire that  the  populations  be  mutually  exclusive  and 
exhaustive.   Examples  of  such  populations  are:   good  and 
bad  credit  risks;  successful  and  unsuccessful  salesm.en; 
bankrupt  and  nonbankrupt  firms;  diabetic  and  nondiabetic 

persons;  and  male  and  female  groups.   The  larger  is  the 

2 

distance  between  tv.'o  populations   or  the  smaller  is  the 

overlap  between  the  probability  distributions  of  t'he 
populations  with  respect  to  some  comim.On  features,  the  more 
suitable  is  the  application  of  the  MDA  technique  and  the 
smaller  is  the  probability  of  misclassi fication  of  an  indi- 
vidual, and  vice  versa. 
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(2)  Ecich   iriombcr    of    a    popul  i.tion    can    be    charac- 
terized  by    a    vector    of    roasurable    at-_rib/utcs    x'    --     (x,  ,    x„  , 

1    i- 

...,  X  ),  v.'hore  p  is  the  nur^loor  of  attributes.   Also,  the 
p  ^  ' 

subject  populations  are  differentiated  from  each  other  by 
the  vector  of  the  means  of  such  attributes  in  each  popu- 
lation, 'j'^  =  ^'-'il'  ■'^12'  •■•'  *'ip^  '  ^°^"  ^  "  ^'    ^'     •••'  ^' 

(3)  The  attributes  x.,  j  =  1,  2,  ...,  p,  are 

distributed  according  to  a  continuous  multivariate  normal 

3 

law  j.n  each  of  the  populations.    In  practice,  however, 

measurement  is  too  primiitive  to  yield  a  continuous  scale. 
Furthermore,  there  are  many  situations  in  v^/hich  the  vari- 
ables are  dichotomous  and  the  data  may  take  the  form  of 
the  existence  versus  nonexistence  of  a  condition.   Under 
certain  conditions,  i.e.,  equality  of  variance-covariance 
matrices  and  largeness  of  the  discriminating  variables,  the 
violation  of  this  assumption  does  not  invalidate  the  MDA 
technique.   By  means  of  computer  programs,  Gilbert,  for 
example,  concluded  that  ".  .  .as  the  number  of  variables 
increases,  Fisher's  linear  discriminant  function,  which 
utilizes  only  bivariate  marginals,  should  remain  fairly 
stable.  .  .  ."'^ 

Many  researchers  have  used  Fisher's  LDF  in  situ- 
ations where  non-normal  and  qualitative  variables  were 

5 
present.    They  have  justified  its  usage  by  the  fact  that 

Fisher  has  shown  that  this  function  m.aximizcs  the  between 
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sum  of  squ::ros  relative  to  the  wjthin  suir,  of  squares, 
regardlc>ss  of  the  distribution  of  x. 

Historic a_l_  Development 

Many  v.'riters  attribute  tlie  development  of  the 
multivariate  discriminant  analysis  Lo  R.A.  Fisher,  who 

formally  published  the  concept  aiid  the  application  of  the 

7 
"linear  discriminant  function"  in  1936.    The  idea  of  das- 
criminating  betv;een  multivariate  populations,  however, 
could  be  traced  back  to  1920.   In  this  year,  Karl  Pearson 
was  busy  in  developing  a  "measure  of  distance,"  called  C  , 
betv/oen  tv;o  populations  in  an  analysis  of  anthropometric 

data.^   But  the  first  work  on  his  coefficient  of  racial 

9 
likeness  v/as  published  by  Miss  Tildesly   in  1921.   Durxng 

the  period  1921-1936  several  noticeable  progresses  were 

made  in  this  field:   Mahalanobis    developed  his  measure 

2  11 

of  distance,  called  D  ,  Hotelling    generalized   student  s 

2  12 

t,"  called  T  ,  and  Neyman  and  E.S.  Pearson    presented  the 

theory  of  testing  hypotiiesis. 

According  to  Girshick,    the  developm.ent  of  dis- 
criminatory analysis  started  wit'i  a  Pearsonian  stage, 
follov;ed  by  a  Fisheriar  and  a  Neym.an-Pearson  stages,  and 
led  to  a  Waldian  stage.   Up  to  about  1950,  the  field  bene- 
fited from  the  influential  works  of  only  a  few  writers, 
whose  numijer  were  not  comparable  to  those  in  the  later 
periods.   Gradually,  however,  this  branc'n  attracted  m.any 
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rosearchors  fro-",  diffcr-.-^nt  di  scipl  ipe?  .   Aru'i  their  con- 
tributions flooded  the  literature  under  a  variety  of  names 
such  as  prediction,  allocation,  identification,  selection, 
classification,  pattern  recognition,  and  discrimination. 

In  fact,  attraction  of  the  field  soon  passed  the 
border  of  individual  researchers  and  captured  the  attention 
of  many  scientific  organizations.   NATO  Science  Committee, 
for  example,  has  been  supporting  many  research  works  in 
this  area  for  several  years  and  two  of  the  following- 
referenced  papers"'"   are  among  another  16  v/hich  comprised 

the  proceedings  of  the  NATO  Advanced  Study  Institute  on 

15 
Discriminant  Analysis  and  Applications,    held  in  Athens, 

Greece,  during  June  19  72. 

The  limitation  of  space  in  the  present  study 

precludes  making  individual  references  to  such  early  and 

later  works.   A  practical  alternative,  however,  is  to  name 

the  surveys  and  bibliographical  works  which  have  been  made 

in  this  regard,  so  far.   The  first  survey  of  discriminatory 

analysis  was  made  by  Hodges-*-^  in  1950.   It  was  sponsored  by 

the  USAF  School  of  Aviation  Medicine  and  distributed  in  a 

mimeograph  form.   In  1962,  Posten  prepared  a  bibliography 

on  classification,  discrimination,  generalized  distance, 

and  related  topics."'''^   This  work  was  also  rrimeographed . 

A  comprehensive  printed  bibliography,  although  in  the 

general  field  of  the  multivariate  statistical  analysis, 


become  available  in  1972.   It  was  prej:v=ired  by  Anderson,  Das 
Gupta  and  Styan. 

More  recent-ly,  Das  Gupta  has  presented  a  compre- 

19 
hensivo  roviev;  on  classit  ication .     Ho  lias,  however, 

concentrated  on.  major  theoretical  papers  and  deliberately 
excluded  many  apoiied  papers  and  computer  programs.   In- 
cluded in  his  review  is  a  fai.rly  extensive  bibliography 
arranged  by  subject  m>atter  and  then  chronologically  within 

subject  ruatter.   The  last  work  in  this  respect  is  that  of 

20 
Cacoullos  and  Styan.     It  is  an  almost  exhaustive  bibli- 
ography of  discriminant  analysis  for  the  time  period 
covered.   It  includes  a  list  of  547  research  pap^ers  and  26 
books  which  v.'ere  published  up  to  and  including  1971  and 
1972,  respectively. 

Finally,  m.ention  should  be  made  of  the  bibliogra- 

21 

phy  prepared  by  Toussamt.     It  is,  however,  on  estimation 

of  miisclassif ication. 

Test  of  Equality  of  Dispersion  Matrices 

Before  any  attempt  is  initiated  to  compute  a  dis- 
criminant function,  having  knowledge  about  the  equality 
of  dispersion  matrices  of  the  sampled  group  is  the  first 
order  of  im.portance .   If  these  m.atrices  are  equal,  the 
proper  discriminant  function  is  a  linear  one.   Otherwise, 
quadratic  classification  procedure  should  be  used. 
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The  effect  of  the  d i spei-.s ion  heterogeneity, 
however,  differs  from  case  to  case.   In  qeneral,  the  extent 
of   the  bias  introduced  into  the  result  by  ignoring  the 
inequality  depends  upon  the  degree   of  the  heterogeneity 
of  the  dispersion  matrices,  the  number  of  variables  in  the 

function,  the  number  of  groups,  and  the  size  of  the  sample. 

2 

Under  the  title  "The  Robustness  of  Hotelling's  T  ,"  and  by 

using  Monte  Carlo  methods  for  generating  sample  data, 

22 
Hollov;ay  and  Dunn    studied  somic  of  these  factors  in  the 

two-group  case.  They  presented  their  results  in  several 

graphs  and  showed  that  in  the  presence  of  dispersion 

inequality,  the  null  hypothesis  stating  that  two  vector 

means  are  equal  is  accepted  more  frequently  by  the  signifi- 

23 

cant.  test.  '   As  a  matter  of  illustrating  the  point,  some 

of  their  interesting  graphs  are  reproduced   here.   Their 
notations  have  been  defined  as  the  following:   p  is  nuniber 
of  variates,  N,  and  N„  are  sample  sizes  of  groups  1  and  2, 
r  =  N,/(N,  +  N„)  ,  E -,  and  Z^  are  covariance  matrices  of  the 
two  populations,  and  d  is  equal  eigenvalues  of  the  matrix 

^2^1   • 

Figure  2  shows  that,  for  each  number  of  variates, 

as  the  sam^plo  size  decreases,  the  actual  level  of  signifi- 
cance (ct)  increases.   This  effect  is  more  pronounced  when 
the  number  of  variates  increases  from  p  -  1  to  p  =  10.   In 
other  words,  as  the  number  of  variates  inci eases,  the 
sam.ple  necessary  for  the  test  to  be  relatively  robust 
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FIGURE    2.       HOTELLING'S    T    ;    ONE    MINUS    SIGNIFICANCE    LEVEL   AS 
A   FUNCTION   OF    SA:4PLE    SIZE 
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FIGURE  3.   HOTELLING'S  T  ;  ONE  MINUS  SIGNIFICANCE  LEVEL 
AS  A  FUNCTION  OF  d  FOR  SEVERAL  VALUES  OF 


=  N-j_/(Nj_+N2) 


1  -  a  =  0.95;  N,+N2  =  50 


Source  of  both  figures:   Holloway  and  Dunn's  article  (see 
note  22) .   Reproduced  by  permission. 
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b'^-comcs  large.   The  effect  is  still  more  pronounced  v;hen 
d  increases  from  3  to  3.0. 

Figure  3  shoe's  t'lat,  for  each  ratio  r,  as  the 
value  of  d  increases,  the  actual  level  of  significance  (a) 
increases.   E^or  a  particular  value  d,  the  significance 
level  is  seen  to  increase  as  r  increases  from  0  to  1.   This 
effect  is  luoro  pronounced  when  the  number  of  variates  in- 
creases, say,  from  p  =^  2  to  p  =  3. 

Similar  studies  have  been  made  by  Hopkins  and 

Clay,'^^  and  Ito  and  Schull.^    Hopkins  and  Clay  concluded 

2 

that  the  distribution  of  T   for  p  -  2  with  N^  =  N2  5  10  is 

rather  robust  in  respect  to  variance  inequality,  but  that 
this  robustness  does  not  extend  to  unequal  sample  sizes. 
The  results  of  the  Ito  and  Schull    study  indicates  that  for 
p  =  1,  2,  3,  and  4,  with  very  large  samples  of  nearly  equal 

size,  inequality  of  covariance  matrices  does  not  appreci- 

2 

ably  affect  the  level  of  significance  or  the  power  of  T 

test  as  long  as  the  characteristic  roots  of  ^2^1    remain 
within  the  range  (0.5,  2). 

Although  these  last  studies  extenuate  the  concern 
about  the  inequality  of  dispersion  matrices,  they  do  not 
mitigate  the  necessity  of  conducting  the  related  test. 
This  is  so,  because  the  conditions  set  forth  for  the 
ineffectiveness  of  the  heterogeneity  are  very  restrictive 
and  are  less  likely  to  be  met  in  practice. 
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2  6 
Several  approac'r.os  have  oocn  presented    for 

testing  the  equality  of  dispersion  matrices,  but  the  one 

which  is  morn  frequently  cited  is  the  approach  taken  by 

Box.  '   This  approach  tests  the  K,  hypothesis,  i.e., 

H,  :  -,  =  ■•  ^  =  ...  =  ^,  f  bv  an  F  statistic.   To  carrv  out 
112  k    " 

the  test  the  following  Quantifies  are  calculated  first: 


2D+3r)-l,_l     1,      -,       ,,, 


'1  "  6  (k-1)  (p+l 


a   _  (p-1)  (p+2)    ,  1^_  _    ]_ 
"^2  eTk-i)    ^^  ^,2    ^2'  ' 

■'k 


(2) 


where  p  =  nuiTiber  of  variables,  k  =  number  of  qroups,  N,  = 

number  of  observations  in  the  k-th  sample,  and  N  =  IN,  . 

k  ^ 
2 
If  A„  -  A,  >  0,  the  test  statistic,  with  f,  and  f^  degrees 

of  freedom,  is 


M_ 
b  ' 


where  f^  -  ^  (^"l^  P   (p+i) , 


H    =    (V2)/(^-\  )' 


(3) 

(4) 
(5) 


b   =  f,/(l-A,-f,/f_)  , 


(6) 


M   =  Nlnls   i.^MNj^lnlS..3^|), 
k 


(7) 


126 


lV".ierc  S^..j,  is  the  usual  unbiased  estimate  of  the  variance 
or  covariar'.ce  bctv;een  the  i-th  and  j-th  variable  m  the 
k-th  sample,  and 


If  A2-A^^<0,  f^  -  (f^+2)/(A^2-A2),  (9) 


and  the  test  statistic  i: 


^1 


F^   =  f2M/[f^(b-M)]  (10) 

Mathematical  Formul ations  for 
Sample  Use 

Although  the  procedure  of  classification  of  in- 
dividuals into  appropriate  groups  is  more  easily  derived  for 
the  cases  in  which  the  parameters  of  multivariate  normal 
populations  (i.e.f  m.ean  vectors  and  dispersion  matrices)  are 
known,  only  the  cases  in  which  the  parameters  are  not  known 
will  be  considered  here.   Such  a  choice  arises  from  two 
considerations:   (1)  the  purpose  of  this  study  is  not  to 
delve  into  the  mathematics  of  the  MDA  technique; ^^  and  (2) 
in  most  applications  of  the  MDA  theory,  the  parameters  are 
not  known  and  are  inferred  from  samples. 

In  presenting  the  subject,  the  matter  related  to 
the  two-group  case  is  introduced  first,  then  the  k-group 
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case  is  cliscus&od.   In  each  case,    also,  the  situations  of 
equal  and  unequal  dispersion  matrices  are  considered  sepa- 
rately , 

The  Two-Gro up  Case 

Equal  dispcrson  matrices .   This  has  been  by  far  the 
most  prevalent  case  in  research  works,  either  due  to  the 
nature  of  data  or  justified  (and,  at  tir;ies,  unjustified) 
assumptions  made  by  researchers. 

Test  of  significance.   Before  any  attempt  is  made 
for  fitting  a  discriminant  function,  the  following  hypo- 
thesis should  be  tested  to  ascertain  the  equality  of  the 
mean  vectors  of  the  two  populations : 

H2  :  u^  -  ^2  (1^^ 

If  the  hypothesis  is  not  rejected,  it  may  indicate  that  the 
means  of  the  two  populations  are  so  close  to  each  other  and 
the  overlap  of  their  related  frequency  distribution  curves 
is  so  large  that  no  practical  difference  can  be  found  be- 
tween the  two  groups.   In  such  a  case  the  predicting  power 
of  any  function  to  classify  an  individual  into  one  of  these 
groups  will  be  too  weak  to  be  useful.   Thus,  only  if  the 
hypothesis  is  rejected,  continuation  of  the  analysis  makes 
sense . 

Several  test  statistics  have  been  developed  for 

29 

testing  the  H-  hypothesis.  '   To  name  a  few,  Hotelling's 
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2  2 

T  ,  Maha.!  o  nobis '  D',  and  Wilks'  A  are  some  of  such  tests 

which  have  been  sho\vn  to  be  interrelated.     Hotelling's 

2  31 

T   statistic,  in  the  way  cited  by  Anderson,    sets  the 

critical  renion  in  the  following  form: 

2    (N^+N  -2)p     p 
r/    ,    ^t-N^V-—  ■  'N^.N,-p-l(a)         (12) 

with  significant  level  a,  where 


2     ^1^-^         -  -  '   -1   -  - 

where  (X, -X„)  is  the  group  mean- difference  vector  of  p 
variables  and  S  is  the  pooled  within-grcup  variance  matri: 
whose  element  is: 


1     2    \ 


ij  "  n:+n72  I  ,      ^   ,    (\im"'\i)  (\jm  ^j^  ' 
-'         12    K=l  m=l  -'  -^ 


(14) 


where  X,  .   is  the  m- th  observation  (m  -  1,  2,  ...  N,  )  for 
the  i-th  variable  (i,  j  =  1,  2  ...,  p)  in  the  k-th  sample 
(k  -   1,  2)  from  the  k-th  normal  population  with  (u,  and  Z) 
parameters,  and  X,  .  is  the  k-th  group  m.ean  of  the  i-th 
variable . 


The  discriminant  function.   Given  that  H,  has  not, 
but  H„  has  been  rejected,  the  appropriate  discriminant 
function  is  a  linear  one  in  the  following  form: 


p 

Z  =  Y.  B,X.  (15) 


l^x   ^  ^ 


There  are  several  approaches  to  assign  values  to 
B's  coefficients,  based  on  the  information  contained  in 
samples.   Ladd,  however,  has  shown  that  most  of  the 
alternate. ves  are  proportional  to  each  other.  "^^   As  the 
first  approach,  he  derives  B  vector  in  such  a  way  that  the 
between-group  variance  is  maximized  relative  to  wi thin- 
group  variance.   In  other  v/ords,  he  determines  B '  s  to 
maximize  the  following  function: 

G  =  B  dd  B/B  SB,  (16) 

where  d  =  (d^,  d^,  ...,  d  )  =  column  vector  of  group-mean- 
differences  (X^-X^) .  The  solution  to  the  above  function  is 
the  vector  of 

B  =  -2-  s~^d,  (17) 

where  -r-   is  a  constant:  a  is  a  scalar  resulting  from  d  B, 
and  A  is  a  Lagrange  multiplier. 

Criterion  for  classification.   After  the 
discriminant  function  is  constructed  in  one  way  or  the 
other,  with  respect  to  the  B's  various  alternatives,  one 
may  wish  to  use  it  to  classify  one  or  more  observations 
as  belonging  to  one  or  the  other  populatici^.   For  this 
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purpose,  7\ndorson  proposes  tho  follov/mg  clar^si  f  ica  tion 
rule:  ^   Assicjn  the  observation  to  group  1,  if  the  follow- 
ing inequality  holds  and  to  group  2,  if  its  strict  reverse 
is  true: 

x's-1  (X.-X^)  ^-  h   ('V^2^  ^"^  (Xj_+X2)  +  In  L       (18) 


where  L 


q^  C(i  i?)/  q-,C(2!l)  ,  (19) 


and  q-,  and  q^  are  a  fJ^^^^LOi-li  probabilities  of  drawing  an  ob- 
servation from  popuJation  rr,  and  ti„,  respectively,  C(ll2)  is 
the  cost  of  misclassifying  an  observation  from  ii  „  as  from 
Ti ,  ,  and  C(2|l)  an  observatio;^  from  7i ,  as  from  tt-.   The 

left-hand  term  of  the  abovt-,  inequality  is  the  subject 

34 
linear  discriminant  function.  '   Thus,  if  v/e  let 

B  =  S"-'-  (X^-X^)  (20) 

Then,  the  above  inequality  becomes: 

X  B  5  i  (X-j^+X^)  B+ln  L  (21) 

m  the  case  that   q^  -  q   and  0(112)  =  C(2|l),  the  last 
term  drops.   The  rule  then  becomes:   af:sign  the  observation 
to  group  1  if, 

x'b  >  J    (^1+^2^  '^  ^^^^ 

and  to  group  2,  if 
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X  B 


1    (X^.X^)  B 


(23) 


Unequal  disperson  matrices.   So  far,  the  discus- 
sion has  revolved  about  the  tv;o-group  case  with  equal  dis- 
persion matrices.   If  the  matrices  are  not  equal,  however, 
the  previous  discussion  sh^ould  bo  revised  in  the  f ollov.'ing 
v.'ay : 


Test  of  significance.   To  test  H   hypothesis, 
Anderson    suggests  two  T  -tests,  one  for  the  case  when 
N-^  =  N   =  N  and  the  other  for  the  case  when  N,  <  N,  .   V7hen 
^1  '--  ^2  -  N, 


1-2  ^  K  (X^-X^)'  S  -*■  (X^-X2)  , 


(24) 


where  S  - 


^-^      L       (X   -X   -X  +X  )  (X^  -X„  -X,+X„)      (25) 
iM-l-    T     xa       la       L       Z  la.       2.0.       1       2 


This  test  has  T —  distribution  with  N-1  degrees  of  freedom. 
Vvhen  N   <  N- , 


=  N-^  (^1-^2^  ^'^  ^^1"'^2''  '   ^''^^^^ 


(26) 


1        ^  -      Ti  1     -^ 

s  =  ^^^ — Y  •■      f^i  -^1-  kr^  (-9  -x^  '•   ^o^)] 


[X^^-X^-   -1  (X^^-A.   ,    x.e)],  for  a^3   (27) 
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This  tcizt    also  has   T-'--dis-Lributien  with  N, -1  degrees  of 
freedom. 


The  functioii.   In  the  case  of  unequal  dispersion 

ma  trice?,  the  aiscrimin.ant  function  vv'ould  have  the  quadratic 

.     36 
form. 

Z  -  I  Inj^^l  -  1  [(x-v^)' c-1  (x-x^)_(x-X2)'s^-l  (x-x^)]   (23) 


vv-here  S^    and  S^  are  the  sample  dispersion  matrices,  whose 
elements  are: 

1  "i 

1  a  — ±  -^      -^ 


^^'^2^ij    N^-l   ;^-L  ^^2ia""^2i^^^2ja"^2j)         (30) 


The  quadratic  function  contains  the  linear  terms  and  the 
quadratic  tcrm.s .  For  a  three-variable  case,  the  previous 
general  function  (28)  results  in  the  following  form: 


=  ^10^1  ■*•  "20^2  +  '-^30^3  +  "11^1^  "-    "21^1^2 


4-  a^^xl    +  ^3^X3^X3  +  0.32X^X3  +  a33X^,        (31) 


Which  can  be  cast  into  the  following  form,  tor  a  m.nemonic 
purpose:  . 


I, \  near  Termo     Var.  1  Var.  2     Var.  3 

Var.  1  aj^g  «j  ^^ 

Var.  2  "20  '21         °22 

Var.  3  "30  '^31         "32       ^^33 

Critcrioti  for  classification.   The  classification 
rule  is  ODtained  directly  from  the  discriminant  function. 
Let 

X^  =  O'-'^-j)  '  S^"^  (X-Xj^)  (32) 

Equation  (28)  then  becomes: 


1  ._.^2|   1 


Z  =  i  Inj^-i-i(x2-v2)  (34) 

2    'Sj    2    1   2 


And  the  rule  for  classification  is:   Assign  the  observation 
to  group  1,  if 


x2  <  x'  +  lnl^-1  -  21nL,  (35) 

1     2      ^i 


and  to  group  2,  otherwise. L  is  the  same  as  defined  in  (19). 
Again,  if  the  a  priori  probabilities  of  the  tv/o  populations 
and  their  costs  of  misclassif ication  are  equal,  the  last 
term  on  the  right-hand  side  of  the  above  inequality  drops 
out. 
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groups  exceeds  two,  the  elocjar.ce  that  the  Fisher's  solution 
enjoys  in  the  two-group  case  disappears.   The  procedure  of 
discriminating  among  several  populations  is,  however,  similar 
to  that  of  the  two-population  case.   As  before,  the  sample 
space  is  divided  by  discriminant  functions  into  k  mutually 
exclusive  regions.   In  this  case,  the  number  of  discriminant 
functions  is  more  than  one.   If  the  number  of  variables,  p, 
is  greater  than  the  number  of  popu]  ations  minus  one,  k--l, 

the  number  of  discriminant  functions  required  would  be  k-1 . 

■57 
Otherv/ise,  it  would  be  p. 

Test  of  significance.   The  following  H^  hypothesis 
may  be  tested  by  two  methods.   The  first  one  is  the 
Bartlett   V,   which  is  the  first  approximation.   The  second 

one  is  that  suggested  by  Rao,  which  is  a  better  approxima- 

38 
tion. 

"2  •  ^1  "  ^2  "  •  •  •  "  ^'k  ^^^^ 

The  Bartlett   V   statistic  is 

V  =  -n.  InA  =  x^  (j,_i)  (37) 

where    m  =  N-l-(p+k) /2 ,  (38) 

and      '^  =  |wJ/|t|  (39) 


An  element  of  W  and  an  element  of  T  are 


K    "k 


-'    k=l   n^l 


K    \ 


(T)  .  .  =   Z     I       (X,  .  -A.  )   (X,  .  -X.)  ,  (41) 

^:j    k=l   n=l    ^^"^   ^     ^^^   ^ 


The  chi-square  in  (37)  has  p(k-l)  degrees  of  freedom.   The 
Rac  test  is: 


ras+2A     y      2r 


where 


(43) 


3  =      (P^(k-l)l-4„)2  (44) 

p^+(k-l)'^-5 


X  =  [p(k-l)-2]/4,  and  (45) 

r  =  p(k-l)/2  (46) 

The  chi-square  test  may  also  be  used  to  test  the 
significance  of  the  j-th  discriminating  function,  given 
that  the  previous  (j-1)  functions  are  significant.   For  this 
purpose,  equation  (37)  can  be  written  in  the  following  form: 

r 
V=  [N-l-(p+k)/2]  L      In  (1  +  X.)  =  x'  P(k-1)  ,     (47) 
j  =  l         ^ 


13  G 


where    the  rolatirm  between  A  and  A.:  is  as  the  following 

(>^'s   are  ordered  such  that  \,  >  \^    >    ...  >  a  }  : 
J  1     2  r 

I W I    ^  1 

A=i-L-,::    j^  (48) 

|Ti   D=l      ^j 

The  successive  terms  In  (1+A.)  are  statistically  independent 
Therefore,  each  of  the  additive  components  of  V  is  distribu- 
ted approxi^aately  as  chi-square  with  p+k-2j  degrees  of 

39 
freedom.     That  is. 


V.  -  [N-l-(p+k)/2]  In  (l+\j)  =  X'-p+j,_2j        (49) 


By  cumulatively  subtracting  V,,  V- ,  ...,  V   from  V,  we  get, 
each  time,  a  remainder  which  has  also  a  chi-square  distribu- 
tion with  p(k-l)  -  (p-i-k-2  j)  degrees  of  freedom  and  is  an 
appropriate  statistic  for  testing  the  significance  of  the 
residual  discrimination.   If  the  residual  is  significant, 
given  that  the  first  j-1  functions  are  also  significant;  it 
would  indicate  that  the  j-th  function  is  significant.   If 
the  residual  is  not  significant  at  the  specified  level,  it 
may  be  an  indication  that  only  the  first  j-1  functions  have 
discriminatory  powers  and  the  remaining  functions  may  be 
disregarded.   This  test,  of  course,  is  not  an  exact  one 
and  should  be  applied  cautiously. 
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The  functions.   Under  the  a:-5u.^.ption  that  cj  ^  and 

q  .,  a  priori  probabilities  of  drav/j  ng  an  observation  from 

populations  -  .  v;ith  donsitv  P.(X)  and  ■:  .    with  density  P  .  (X)  , 

respectively,  are  knov.'n,  the  regions  of  classification  R^  and 

R   are  defined  by  a  set  of  discrirainatory  fractions  v.-hich 

3 
minimize  the  expected  cost  of  mi.sclassi  f  ication.   These 

functions  are  linear  and  are  constructed  by  the  likelihood 
40 


ratio; 


a  .(X)  -  log  1^1-=  [x-l   (x.^x.)]  s"^  (x.-x.) ,  (50) 


where   S    is   defined   by 

V  K  i 

(    E     N.-k)S  =      S  T.         (X.    -X.)     (X.    -X)'  (51) 

\    ,      1  .    T  ^  la      1  la      1 

1=1  1=1       Ci=l 

The  number  of  such  discriminating  functions  is  either  equal 
to  k(k-l),  one  for  every  pair  of  populations,  or  can  be 
reduced  to  k-1  two-class  problems,  v;here  the  i-th  problem 

is  solved  by  a  linear  discriminant  function  that  separates 

.  ^      41 
points  assigned  to  tt  .  from  those  not  assigned  to  n^. 

Criterion  for  classification.   Under  the  above 

42 

assumptions,  the  classification  rule  is  as  the  follcwmg: 

Assign  to  group  i,  if  for  all  j's 

X?  5  X?  +  2  In  (  q •/  q  •)  ,  for  i,  j  =  l,  2,  ...,  k   (52) 
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where  x|  =    (X-X^)   S  "'"  (X-X^),  and  (53) 

X?  =  (X-X  )   S"^  (X-XJ  (54) 

4  3 
The  rule  may  also  be  stated  in  the  follov;ir.g  forra: 

Assign  to  group  i,  if 

U..(X)  >  In  (q_j/q^),  for  i  ,  j  -  1 ,  2  ,  .  .  .  ,  k  ;  i?^  j   (55) 

Since  U.  .  =  — -  U.  .,  there  is  no  need  to  calculate 
ID     ID 

more  than  k  (k-l)/2  coefficient  vectors  of  the  functions. 

In  fact,  soma  of  the  coefficients  can  be  calculated  from 

44 
others    and  the  nUiiiber  of  independent  functions  is  actually 

k-1. 

Unequal  dispersion  iriatrices.   The  previous  discus- 
sion embraced  tests  and  classification  rules  pertaining  to 
the  k-group  case  under  the  assumption  that  the  dispersion 
matrices  of  the  groups  are  equal.   If  such  an  assumption 
does  not  correctly  represent  a  real  situation,  the  above 
tests  and  rules  do  not  hold  any  longer.   The  appropriate 
treatment  for  the  case  of  unequal  dispersion  matrices  is  as 
the  following: 


Test  of  significance.   Anderson  suggests  a  test 

statistic  to  test  the  hypothesis 

k 

H  :   r,    3^.  Vi.  =  u,  (56) 

i  =  l 
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v.'here  B^ ,  B^ ,  ...,  B   are  given  sci^lars,  u-  is  the  mean 

45 
vector  of  the  i-th  population,  and  u    is  a  given  vector. 

However,  the  desired  hypothesis  to  be  tested  is 

Sisenbeis  and  Avery  shov;  that  H„  hypothesis  can  be  trans- 
formed to  a  form  which  is  suitable  for  testing  by  the 

46 
Anderson's  test.     The  proposed  test  is: 

T2  ^  N{Y-m)  's"^  (Y-m)  (58) 

where  S  and  Y  are  defined  by 

(N,-1)S  =   E   (Y  -Y)   (Y  -Y)  (59) 

a  =  l 

k        N,l       ,   ^^1  1    ^i 

Y  =  B,X,   +  Z        B  .  (--)  (X.  -  r^  L   X.  3  +  .       I       X.  )  ,   (60) 
a    1  la    .  ^    1  'iM.    la   N,  „  ,   i     „  ^,      ^   JY 
J.  ==2         1         1  g-^1      JN.N'   Y=1 

_         ^1         k 

Y  =  j^    Y.       Y      =       I         B.X.  (61) 

1   a=l        1=1 

N. 

^i  -  w:    /,  ^iB  (^2) 

X 
ia  is  an  observation  vector  from  population  i  and  Ni  is  its 

number  of  observation  with  N,  being  the  smallest  one.   When 

H  hypothesis  is  true  the  statistic  has  the  T^- distribution 

for  dimension  p,  with  N,-l  degrees  of  freedom. 
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Tho  func-cjon.   When  the  dispersion  matrices  of  the 
subject  groups  are  not  equal,  the  discriminant  functions 
would  be  in  quardratic  f  orr-s .   The  procedure  of  constructing 
such  functions,  hov.'ever,  follows  that  of  the  linear  case. 
And,  utiliz-ation  of  the  likelihood  ratio  yields  k(k-l) 
functions  cf  the  following  form: 


U   (X)  =  I-   Inlgij  -|[(X-X  )'s  '■'■  (X--X  )  -  fX-X.)'s:^  (X-X.)] 

(63) 
f  or  1 ,  j  -  1 ,  2 ,  .  .  .  ,  k ;  i  7^  j 

where  S^  and  S .  are  the  sample  dispersion  matrices  of  group 
g  and  h,  respectively.   An  element  of  such  matric  is: 

'^i'sh^S:^   -,  <^iga-V  «ih.„-V-*°'^i=l'  2 '^      (64) 

1    a— 1 


The  difference  between  such  function  and  its 

linear  counterpart  hinges  upon  the  gradratic  terms  of  x's  "■'■X 

1 

and  X  S  .   X  which  result  from  the  expansion  of  (X-X.)  S  ."■"■ 
(X-X^)  and  (X-XJS."  (X-X .),  respectively .   In  the  case  of 
equal  dispersion,  the  quadratic  terms  are  cancelled,  leav- 
ing only  the  linear  terms.   This  is  not  the  case,  however, 
when  the  dispersion  matrices  are  not  equal.   The  related 
functions,  therefore,  contain  both  linear  and  quadratic 
term.s . 
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Criterion  for  clasi^if ication.   When  the  a  priori 
probabilities  of  ir.e-r.borship  in  populations  are  known  the 
classification  rule  is  as  the  following: 
Assign  to  group  i ,    if 


2  S .  q . 

^i  .<  x^  -  In  1^-^-1  +  2  In——,  for  i,  j  =  1,  ..  .,k    (61 


where  x''  and  x"^  are  defined  by 
1       D 


X?  =  (X-X^)'  S^  ^  (X-X^)  (66) 

X?  -  (X-Xj)'  S^~^  (X-Xj)  (67) 

The  rule  ir.ay  also  be  stated  in  terms  of  the  functions: 
Assign  to  group  i,  if 


U^j  (X)  >  In  (  q^/q,)  (68) 

Relatio  n  to  Regrerj  si  on  Analy  sis 

Researchers  are  usually  rcore  familiar  with  regres- 
sion analysis  than  with  multivariate  discriminant  analysis. 
In  general,  hov;ever,  both  approaches  are  characterized  by 

several  similar  features.   In  his  1966  paper,    Ladd 

49 
compared  linear  probability  functions    and  discriminant 

functions,  and  concluded  that  they  "start  from  quite  dif- 
ferent places,  follow  different  routes,  and  end  up  at 

50 

nearly  the  same  place." 
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Fisher  shows  that  by  perforrr.ing  a  regression 
analysis  in  which  the  dependent  variable  of  the  first  group 
equals  N^/CN^+N^)  and  that  of  the  second  group  equals' 
-Nj^/CN^+N^)  ,  the  regression  coefficients  are  proportional 
to  those  of  the  discriir.inant  function.^''' 

Such  a  regression  analysis  may  be  written  in  the 
following  form,  as  set  forth  by  Kendall :^^ 

Source       Sum  of  Squares  Degrees  o£  Freedom 

Regression    [N  N„/ (N^ +N_ )  ]  E  p. .  (X,.-X^.)  p 

1  -^    12      X   li   2i  ^ 

Residual      iN^N^/ (Nj^+N2)  ]  [1-S  £^  (X^^-X^^^)  ]    N^+N2~p-1 
Total         N^N^/CN^+N^)  ^1+^2-1 

where  1^  is  the  i-th  coefficient  of  the  regression  line. 

5  3  2 

It  has  been  shown    that  the  R  ,  the  multiple  correlation 

coefficient  of  the  regression,  calculated  from  the  above 
table,  has  the  following  relation  with  D   of  the  correspond- 
ing discriminant  function: 

2     r2     (N^+N  ) (N  +N  -2) 
1-R  "l^^2 

The  analysis  of  such  regression  yields  the  same  F  as  does 

2 

the  D   analysis. 

Cramer  generalized  the  conclusion  obtained  by 
Fisher,  as  stated  above,  and  concluded  that  the  same  rela- 
tion holds  for  any  two  distinct  values  of  the  dependent 

^4 
variable . " 
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The  Reliability  Tests 

As  was  discussed  before,  one  of  the  purposes  of 
constructing  an  MDA  model  is  its  application  to  assigning 
individual  observations,  drawn  randomly  from  two  or  more 
populations,  to  their  related  groups.   If  the  assignmients 
are  raade  correctly  for  most  of  the  times,  the  model  is  said 
to  be  accurate.   Otherwise,  it  does  not  errtbody  sufficient 
discriminatory  power  to  be  of  any  value.   There  are  several 
methods  to  test  the  reliability  of  the  computed  model.   Four 
of  these  methods  which  are  most  often  used  are  the  follov;ing 
(1)  The  Original  Sample  Method,  (2)  The  Holdout  Sam.ple 
Method,  (3)  The  Lachenbruch  one-at-a-timie  Holdout  Method, 
and  (4)  The  Scraifibled  Sample  Method. 

The  Original  Sample  Method.   To  determine  the  accu- 
racy of  the  model,  it  has  been  a  usual  practice  to  set  up  a 

so-called  confusion  matrix,  which  v;as  first  introduced  by 

55 
C.A.B.  Smith.     But,  as  noted  by  Frank,  Massy,  and 

Morrison,    this  method  underestimates  the  probability  of 
misclassification.   The  bias  is  due  to  the  fact  that  the 
observations  v/hich  are  used  in  constructing  the  classifica- 
tion matrix  are  the  sam.e  observations  which  have  been  used 
to  build  up  the  discriminant  model. 

The  Holdout  Sample  Method.   An  alternative  m.ethod 
which  has  been  devised  to  eliminate  the  above  bias  is  the 
holdout  method.   In  this  method  the  sample  is  first  divided 
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into  tvv-o  parts.   The  first  part  is  used  to  derive  the 
discriminant  function,  and  the  second  part  is  then  classi- 
fied by  moans  of  the  function  derived  from  the  first  part. 
The  proportion  of  observations  in  the  second  part  which  have 
been  misclassif ied  constitutes  the  probability  of  misclassi- 
fication.   As  noted  by  Lachenbruch  and  Mickey,  ^"^  the  main 
attraction  of  this  method  is  the  ease  with  which  the  error 
rates  can  be  estim.ated  along  with  the  independence  of  the 
distribution  of  the  estimates  upon  the  form  of  the  population, 
Its  drawbacks,  however,  are  its  requirement  for  a  large 
sample  v/hich  may  not  be  available  in  many  cases,  and  its 
uneconomical  use  of  data. 

The  Lachenbruch  one-at-a-ti ma  Koldout  Method.   In 

an  attempt  to  remove  these  deficiencies,  Lachenbruch  pro- 

58 
posed  another  alternative.     His  suggested  method  holds  one 

observation  at  a  time,  computes  a  discriminant  function  with 
the  remainder  observations,  classifies  the  omitted  observa- 
tion with  the  computed  function,  records  tae  result,  if  the 
observation  is  misclassif ied,  and  repeats  the  procedure  for 
all  the  individual  observations.   The  number  of  misclassif ied 
observations  obtained  in  this  way  is  then  used  to  comnute  the 
error  estimates.   Lachenbruch  suggests ■ that  these  estimates 
are  unbiased  for  the  probabilities  of  misclassif ication  for 
a  discriminant  function. 

The  Scrambled  Sample  Method.   VThen  the  available 
sample  is  too  small  to  be  split  into  two  parts--one  for  the 
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analysis  and  another  for  tho  validation,  the  scranblod 
sample  iricthod  riay  be  used  for  evaluatinrj  the  validity  of  the 
discriminant  model  constructed  from  the  entire  available 
sample . 

This  method,  which  has  been  suggested  by  Frank, 

59 
Massy,  and  Morrison,    employs  the  original  sample  and  re- 
assigns the  individual  observations  to  populations  at  random. 
Then  the  discriminant  coefficients  for  this  scrambled  sample 
are  computed,  and  the  discrim.inating  power  of  this  model  is 
estimated  from  its  associated  confusion  matrix  (classifica- 
tion table),  as  for  the  original  sample.   Since  the  indivi- 
duals were  assigned  to  the  populations  randomly,  the  dis- 
criminating power  of  this  model  should  be  zero.   The  devia- 
tion of  its  power  from  zero  would  be  due  to  the  sample 
bias    of  the  original  sample. 

To  test  the  significance  of  the  difference, 
binomial  distribution  theory  may  be  used  in  the  two- 
population  case  for  the  following  t  test: 


t  -  (Q-P)/ap,  (70) 


where  Q  is  the  proportion  of  sample  observations  correctly 
classified  by  the  discriminant  analysis,  P  is  the  correct 
proportion  one  expects  by  chance,  and  a;.  =  P(l-P)/N. 

Finally,  the  sample  bias  observed  in  this  way  can 

then  be  taken  into  account  in  evaluating  the  discriminatory 

6  2 
power  of  the  original  model." 
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The  Rclat i v o  I n por'.:ancc  of  Variablos 

In  constructing  a  discriminant  function  the  re- 
searcher wishes  to  include  those  variables  v/hich  ha^-'e  the 
greatest  discriminatory  power  and  exclude  those  having  the 
sm.allest.   If  there  were  no  correlation  betv;een  the  vari- 
ables,  -  ran^-'.ing  of  the  variables  could  be  made  by  means  of 
either  the  univariate  F-test,  vv^hich  is  based  on  the 
previously-discussed  V7ilks'  la;Tibda,  or  the  weighted  dis- 
crim.inant  function  coefficients.     The  more  general  cases, 
however,  are  those  in  which  the  variables  are  not  indepen- 
dent of  each  other.   Unfortunately,  for  such  cases,  there  is 
no  test  avai,lab]e  yet  which  can  determine  the  discriminatory 
power  of  a  specific  variable.   It  is,  however,  possible  to 
evaluate  the  relative  importance  of  the  variable  subsets. 

The  multivariate  generalization  of  the  univariate 
VJilks'  lambda  provides  a  means  to  rank  all  the  variable  sub- 
sets having  the  same  size.   For  this  purpose,  an  F-test  is 
formed  on  the  basis  of  the  corresponding  Wilks'  lambda.   It 
is  then  used  in  one  or  m.ore  of  the  following  selecting 
procedures . 

There  are  four  stepwise  procedures  for  evaluating 
the  relative  importance  of  various  variable  subsets .   They 
are:   (1)  Conditional  Deletion  Selection,  (2)  Forward 
Selection,  (3)  Backward  Selection,  and  (4)  The  Complete 
Selection.     But  none  of  these  procedures  will  necessarily 
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yield  the  sarr.o  ranking  for  ojch  variable.   The  difference  in 
the  ranking  is  due  co  the  correlation  betv.'een  variables 
coupled  with  the  way  in  which  variables  are  successively 
added  to  or  deleted  frora  the  starting  subsets  in  different 
procedures.   VJith  the  exception  of  the  last  procedure,   v/hich 
searches  among  all  combinations  of  a  subset  with  a  given  size 
to  find  the  "best"  one,  other  procedures  do  not  give  a 
chance  to  a  variable  to  re-enter  into  any  subsequent  sub- 
set, once  it  has  been  dropped  from  a  current  subset.   Since 
inclusion  of  a  variable  with  negative  correlation  is  helpful 
in  increasing  the  relative  importance  of  a  residual  subset 
and  exclusion  of  such  variable  deducts  from  its  importance, 
and  the  opposite  is  true  for  the  variables  v;ith  positive 
correlations,  changing  the  order  of  addition  and  deletion 
of  the  variables  changes  the  ranking  of  a  variable  from  one 
method  to  another. 

At  present,  the  preference  of  one  procedure  over 
another  is  largely  a  subjective  judgment  and  depends  on 
several  factors.   Some  of  these  factors  are  the  desired 
level  of  trade-off  between  accuracy  and  costs,  the  goal 
of  the  research,  and  the  feasibility  of  employing  a  method 
under  a  set  of  constraints.   Whereas  the  univariate  F-test 
of  individual  variability  is  the  crudest  method,  for 
exam.ple,  it  needs  much  less  computer  time  than  the  complete 
selection  method.   While  one  procedure  may  yield  a  better 
result  in  maximizing  the  differences  between  group  means. 
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another   prccedurc  may  give  a  better  classification  result. 
And,  finally,  the  total  number  of  variables  may  be  a 
determinincj  factor  in  employing  the  complete  selection 
method  for  selecting  the  best  subset  with  optimal  size. 

It  should  be  noted  that  all  the  selection  pro- 
cedures are  based  on  the  assumptions  that:   (1)  the  data 
arise  from  r.iultivariate  normal  populations,  and  (2)  the 
associated  dispersion  matrices  are  equal .   When  these  con- 
ditions are  not  met,  the  stepv7ise  selection  procedures  are 
only  approximate  methods . 

In  the  selection  procedures  discussed  above,  the 
importance  of  variable  subsets  is  compared  only  for  a 
given  size,  in  each  step.   To  compare  subsets  of  different 
size,  we  need  to  have  another  procedure.   Unfortunately, 
only  a  special  kind  of  such  procedure  is  available  which 
has  been  given  by  Rao.   In  fact,  his  algorithm  is  a  test 
of  the  hypothesis  of  sufficiency  of  subset  of  variables 
(X^,  X^,     ...,  X  )  or,  equivalently ,  the  hypothesis  of  the 
absence  of  additional  information  of  the  rem.ainlng  subset 
(X   -J,  ...,  X  ).   Following  is  his  suggested  F-statistic. 

P  <3         i+cd| 

where  C  =  N^N^/ (N^+M^) (N^+N2-2) ,  D^  is  the  Mahalanobis  D^ 

2 

statistic  on  the  entire  variables  set,  and  D   is  that  on 

the  selected  subset. 
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Computer  Prograr.3 

There  are  severeil  MDA  computer  prograip.s  available 
either  in  a  listing  or  in  a  package  form.   Each  of  these 
programs,  however,  has  its  ov^n.  capability  and  limitations. 
Therefore,  to  raake  an  efficient  use  of  them,  their 
characteristics  should  be  knov/n. 

Cooley  and  Lohnes'  Program.   Perhaps  one  of  the 
earliest  MD7\  computer  programs  is  that  of  Cooley  and  Lohnes 
which  v;as  published  in  196  2.   It  is  a  part  of  their  book  on 
multivariate  procedures."    This  program  computes  multiple- 
discriminate  functions  for  up  to  50  variables  and  any  number 
of  groups.   Wilks'  lambda  test  of  the  significance  of  dis- 
crimination and  scaled  vectors  to  show  the  relative  contri- 
butions of  the  variables  to  each  function  are  also  computed. 
Their  classification  program,  v;hich  is  separate,  computes 
discriminate  scores  and  probability  of  group  membership  for 
up  to  20  groups,  20  functions,  and  50  variables, 

BMD04M.      This  program  com.putes  a  linear  function 
for  two  groups  with  sample  size  of  up  to  300  in  each  group 
and  a  number  of  variables  of  up  to  25.   Its  output  includes 

mean  scores,  dispersion  matrix  and  its  inverse,  function 

2 

coefficients,  D   statistic,  z-values  of  the  observations 

and  their  group  mean  and  variance. 

BMD05M.      This  program  computes  a  set  of  linear 
functions  for  up  to  five  groups  with  sample  size  of  up  to 
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175  in  each  group  and  a  number  of  variables  of  up  to  25.   It 
performs  the  group  assignracnt,  and  its  procedure  is  based 
only  on  the  esrimatod  probability  density.   It  also  evaluates 
the  coniputed  functions.   Tne  output  of  this  program  is  simi- 
lar to  that  of  the  previous  one  with  additional  information 

on  classification  and  evaluation. 

70 
BMD07M.      This  program  also  computes  a  set  of 

linear  functions,  but  its  main  features  are  the  stepwise 

procedure  and  plotting  of  the  first  two  canonical  variables. 

The  number  of  variables  and  that  of  groups  should  not  exceed 

80.   Its  classification  procedure  uses  prior  probabilities. 

The  output  includes  most  of  those  mentioned  in  the  previous 

progreiTu,  with  additional  information  for  each  step. 

71 

SAS  DISCRIM  Procedure .     This  program  performs 

discriminant  analysis  based  either  on  the  individual  within 
group  covariance  matrices  or  on  the  pooled  covariance 
matrix.   The  choice  is  either  optional  or  according  to  the 
result  of  the  homogeneity  test.   The  procedure  is  direct, 
i.e.,  it  does  not  use  stepwise  method.   However,  it  uses 
the  prior  probabilities  in  classification.   The  output 
includes  values  of  the  classification  variables,  fre- 
quencies, and  prior  probabilities,  simple  descriptive 
statistics  for  each  group,  and  some  other  optional  informa- 
tion.  It  also  prints  classification  results  for  each 
observation. 
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SPSS  DTFCRTMINANT  Progra:^.''^   This  program  performs 
discriniinant  analysis  either  directly  or  through  stepwise 
methods.   Its  classification  procedure  nay  be  optionally 
based  on  the  separate  group  covariance  matrices,  rather  than 
the  pooled  matrix.   In  addition  to  the  descriptive  statis- 
tics, it  optionally  computes  a  test  for  the  equality  of  the 
covariance  matrices.   Additional  options  control  several 
other  features.   These  include  handling  of  missing  values, 
plotting  and  punching  discriminant  scores,  printing  the 
classification  function,  printing  and  punching  the  classi- 
fication results,  and  punching  of  various  matrices. 

7  "^ 
Eisenbeis  and  Avery's  MULDiS  Program.     This  is 

perhaps  the  most  comprehensive  program  on  MDA ,  so  far.   The 
listing  of  the  output  items,  of  itself,  requires  3-4  pages. 
The  available  options  are  also  of  mtany  varieties.   These 
include  features  such  as  linear  and  quadratic  approaches, 
complete,  forv;ard,  and  backward  stepv;ise  selection  pro- 
cedures, and  Lachenbruch  holdout  classification  method. 
The  contribution  of  the  authors  comes  not  only  from  their 
program,  but  also  from  their  excellent  book  which  has  been 
devoted  entirely  to  the  subject  matter. 

Application  of  MDA  in  Financial  Area 

In  contrast  to  the  early  applications  of  the  MDA 
in  the  areas  of  anthropology  and  plant  taxonomy  which  was 
around  1920  and  1935,  respectively,  and  with  the  exception 
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of  the  pionooring  work  of  David  Durand    in  1941  on 
evaluating  the  credit  v;orthiness  of  loan  applicants  by 
utilizing  MDA  technique,  its  use  in  the  financial  area 
caine  as  late  as  1959,  when  James  Walter  attempted  to  clas- 
sify  firms  into  high  or  low  price-earnings  ratio  groupings. 

The  v/ork  of  iMyers  and  Forgy  on  development  of  nu- 

7  fi 
m.erical  credit  evaluation  systems,    that  of  Paul  Smith  on 

77 

measuring  risk  in  installment  credit,     and  that  of  Keith 

7  P 
Smith  on  classification  of  investment  securities    appeared 

in  the  next  fev,'  years.   These  works  are  briefly  discussed  in 

the  thesis  of  Altman,  v'ho  m^ade  an  extensive  and  successful 

use  0£  MDA  m  predicting  corporate  bankruptcy,    in  1967. 

Altman 's  study  has  been  influential  and  rightly  deserves 

special  mention. 

The  utilization  of  MDA  in  the  financial  area  has 

been  intensified  during  the  last  few  years.   A  brief  review 

of  published,  and  a  few  unpublished,  studies  follows: 

SO 
Carleton  and  Lerner    sought  to  develop  a  scoring 

system  for  municipal  bonds  which  could  duplicate  the  ratings 
of  Moody's.   For  this  purpose,  they  first  considered  five- 
group  ratings.   Then,  they  experimented  v/i  th  a  two-group 
rating.   Ey  using  a  six-variable  profile  they  obtained 
linear  discriminant  functions  for  each  case.   But  their 
predictive  efforts  produced  mixed  results.   The  analysis, 
nevertheless,  provided  insights  into  the  structure  of  bond 
ratings . 
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8  1 
In  another  study,  Ronv/ick    employed  a  linear 

discriminant  analysis  to  verify  that  investment  portfolio 
behavior  can  be  characterized  and  classified  using  a  com- 
bination of  four  interrelated  variables.  The  function  he 
finally  computed  contained  only  three  of  the  four  variables, 
and  was  successful  in  discriminating  between  high  perfor- 
mance portfolios  and  low  performance  portfolios. 

8  "^ 

Haslem  and  Longbrake  "  utilized  a  stepwise  dis- 
criminant analysis  to  find  a  set  of  variables  which  can 
explain  differences  in  bank  profitability.   In  the  final 
function,  they  used  eight  variables  among  which  the  ones 
related  to  operating  expenses  weie  particularly  important. 
In  developing  the  model,  they  neither  tested  for  the 
equality  of  dispersion  matrices  nor  used  a  holdout  sample 
method.   However,  they  tested  the  relevance  of  the  vari- 
able set  for  all  sample  banks  by  the  m^ultiple  regression 
analysis . 

By  using  a  linear  discriminant  function  Williams 

83 

and  Goodman    concluded  that  financial  variables  do  tend 

to  distinguish  between  the  various  industrial  classifica- 
tions.  According  to  their  report,  a  set  of  11  selected 
variables  differentiates  between  utilities  and  industrials. 
Five  selected  industrial  classes,  i.e.,  chemical,  drugs, 
domestic  oils,  steel,  and  electronics,  also  appear  to  be 
quite  distinct  by  the  help  of  14  variables. 
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By  empJoymg  a  stepwise  linear  discriminant  pro- 
cedure and  by  using  a  sraa  11  sample  size  of  42  firms, 
Edmister    b.iilt  a  model  v;hich  was  highly  accurate  in  pre- 
dicting small  business  failure  in  the  original  sam.ple.   For 
the  validation  purpose,  however,  he  used  the  simulated 
sample  approach. 

The  capital  adequacy  of  commercial  banks  has  also 

been  a  subject  of  a  discriminant  analysis.   The  linear 

85 

model  developed  by  Dince  and  Fortson    for  this  purpose 

contained  five  variables,  and  its  validity  was  tested  by 
an  independent  sample  of  size  38,  which  resulted  in  only 
about  8  percent  misclassif icati on. 

Deaken    first  replicated  Beaver  s  study''  with  a 
new  sample  based  on  a  univariate  analysis,  and  then  uti- 
Id.-^ed  Beaver's  14  financial  ratios  in  constructing  a  linear 
discriminant  function.   He  concluded  that  MDA  can  be  used 
to  predict  business  failures  as  far  as  three  years  in 

advance  with  a  fairly  high  accuracy. 

8  8 
Lusk    explained  by  an  example  that  the  linear 

discriminant  analysis  can  be  effectively  utilized  for 

screening  divisional  investment  proposals  based  on  a  set 

of  characteristics  which  can  be  found  in  the  firm's  two 

ex-post  accepted  and  rejected  investment  populations. 

89 
McCall  and  Eisenbeis    distinguished  between  a 

group  of  comjTtercial  banks  which  have,  and  a  group  of 

commercial  banks  which  does  not  have,  affiliation  with 
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Raving  hanks.   Thoir  analysis  indicated  that  an  affiliated 
bank  tended  to  have  a  lov;er  participation  rate  in  both  the 
time  and  savings  deposit  and  residential  real  estate  loan 
markets  than  an  unaffiliated  bank.   The  result  of  their 
quadratic  discrirainant  function  showed  that  the  two  groups 

arc  distinct  and  overlap  very  little. 

90 

Pinches  and  Mingo    attempted  to  develop  and  test 

a  model  for  predicting  industrial  bend  ratings.   Their  final 
model,  consisting  of  four  linear  functions,  incorporated 
six  variables  and  correctly  predicted  approximately  6  0  per- 
cent of  the  ratings  for  a  holdout  sample  and  another  sample 
of  newly  rated  bonds. 

The  objective  of  the  study  conducted  by  Bates^"'- 
was  to  estimate  a  discrim.inant  function  which  is  more  ef- 
fective in  identifying  successful  loan  application  from 
urban  black  enterpreneurs .   He  estimated  tv.'o  quadratic 
discriminant  functions  -  one  for  black  and  one  for  white 
businessmen.   His  results  suggest  that  the  discriminant 
function  which  is  appropriate  for  judging  white  borrowers 

does  not  effectively  classify  black  borrowers. 

92 

Gilbert  em.ployed  a  quadratic  discriminant  func- 
tion to  distinguish  between  decisions  of  banks  to  open  and 
not  to  open  de  novo  branch  offices  in  the  relevant  markets 
within  a  three-year  period  following  federal  regulatory 
action  on  their  merger  applications.  His  model  classified 
about  6  9  percent  of  his  holdout  sample  correctly. 


156 


In  soarching  for  factory  v;hich  make  the  holder  of 

a  bank  charge-card  an  active  or  inactive  user,  Awh  and 

93 
Weters    employed  a  linear  discriminant  function.   They  re- 

pori:ed  that  their  discriminant  function  containing  seven 

variables  correctly  classified  78  percent  of  the  cardholders 

However,  they  do  not  specify  that  this  percentage  has  come 

from  the  original  sam.ple  or  from  a  secondary  or  holdout 

sample.   There  is  also  nc  mention  about  the  homogeneity 

test  of  dispersion  matrices  of  the  two  groups. 

94 

Trieschmann  and  Pinches    conducted  a  study  to 

develop  a  model  that  will  be  able  to  statistically  dis- 
criminate between  distressed  and  non-distressed  property- 
liability  insurance  firms.   Their  model  consisted  of  a 
linear  discriminant  function  with  six  financial  variables. 
It  could  correctly  classify  94  percent  of  the  original 
sample.   The  validity  of  the  model  v.'as  checked  by  the 
simulated  sample  aipproach. 

In  evaluating  the  social  (versus  private)  value  of 
regular  invcstmient  advice,  Logue,  Radcliffe,  and  Tuttle^^ 
undertook  an  empirical  i.nvestigation  to  determine  whether 
analysts'  recomiaendations  are  more  closely  associated  with 
future  factors  than  with  past  factors,  .and  to  identify 
factors  which  impact  m.ost  significantly  in  a  recommendation. 
The  results  obtained  from  numerous  stepwise  linear  dis- 
criminant functions  distinguishing  among  various  investment 
recommendation  categories  suggested  that  explanatory  power 
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is  greater  usItkj  the  past  data  than  future  data.  The  power 
is  even  greater  when  both  selected  past  and  future  variable; 
are  included. 
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CHAPTER  IV 

THE  EMPIRICAL  TEST  OF  THE  HYPOTHESIS 
P^m    PRESENTATION  OF  RESULTS 

Intro du ct  ion 
In  Chapter  I,  it  was  stated  that  one  of  the  main 

purposes  of  this  study  is  to  j.dentify  economic  and  non- 
economic  factors  which  might  serve  to  differentiate  between 
financially  distressed  securities  brokerage/dealer  firmis  and 
those  which  appear  to  be  economically  viable.   To  ensure 
this  purpose,  an  analytical  framev^ork  v/as  set  forth  in 
Chapter  II  where  it  vv^as  liypothesizeJ  that  the  probability 
of  financial  distress  of  a  firm  at  time  t  is  a  function  of 
the  net  cash-flow  per  dollar  of  assets  at  time  t,  the  extent 
of  the  risk  assumed  by  and  imposed  upon  the  firm,  and  the 
mismanagem.ent  of  the  firm  (equation  1  in  Chapter  II)  .   It 
was  also  concluded  that,  because  the  specific  relations  be- 
tween these  elements  and  the  probability  of  the  financial 
distress  could  not  be  known,  the  proposed  general  relation 
should  be  expanded  to  include  all  possible  factors  affecting 
the  above-mentioned  elements,  ::nd  then  the  expanded  function 
should  be  exposed  to  a  statistical  test,  using  the  MDA 
technique . 

Having  discussed  the  role  of  the  MDA  technique  in 
differentiating  between  different  groups  in  Chapter  III,  it 
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is  now  tiroe  to  empirically  test  the  suggested  hypothesis. 
But,  to  fully  test  the  hypothesis,  there  is  a  need  to  have 
data  on  all  the  related  factors  included  in  the  expanded 
form  of  equation  (1)  in  Chapter  II.   Unfortunately,  however, 
as  reported  in  Chapter  I,  about  half  of  the  required  data, 
specially  with  respect  to  the  revenue  and  expense  items, 
could  not  be  obtained  due  to  their  confidentiality.   As  a 
consequence,  the  cash-flov;  element  in  the  hypothesized  func- 
tion is  now  substituted  with  a  more  general  term,  the  pro- 
fitability.  And  the  test,  therefore,  is  conducted  on  a 
limited  basis  using  only  the  information  v/hich  is  available. 
As  will  be  shown  later  m  this  chapter,  even  with  this 
limited  information,  the  results  are  encouraging. 

The  chapter  starts,  after  the  introductory  section, 
with  the  description  of  the  source  of  data.   Then,  the  method 
used  in  selecting  the  sample  is  explained.   And,  after 
specifying  the  data  set,  the  subjects  related  to  Test  I  are 
presented.   This  test  is  an  experiment  directed  to  test  the 
modified  hypothesis  with  the  information  contained  in  the 
last  financial  reports  of  the  failed  firms,  which  were 
issued,  on  the  average,  one  year  prior  to  th.eir  bankruptcy 
dates.   Later,  at  the  end  of  the  chapter,  a  search  will  be 
made  to  determine  the  possibility  of  the  extension  of  the 
methodology  employed  in  Test  I  to  a  more  remote  time,  i.e., 
two  years  prior  to  bankruptcy.   This  experiment  forms  Test 
II.   The  results  of  each  test  are  presented  at  the  completion 
of  their  related  analyses. 
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Source  of  Data 

As  v.'as  briefly  mentioned  in  Chapter  I,  the  main 
source  of  financial  data  for  the  study  was  the  Financial 
Questionnaire,  Form  X-17A-5.   A  small  number  of  firms, 
hov/cver,  had  their  Balance  Sheets  on  file,  instead,  or  had 
reported  their  financial  position  on  Form  X-17A-11.   Thus, 
wherever  Form  X-17A-5  was  not  available,  an  attempt  was  made 
to  obtain  similar  item.s  of  information  from  these  other 
sources . 

In  using  these  source?,  however,  running  into  some 
problems  was  inevitable.   A  part  of  the  missing  data  problem 
emanated   from   the   use   of   different  sources  of  data  to 
obtain  the  same  financial  item.   This  point  will  be  dis- 
cussed later.   The  second  problem  is  related  to  the  wide 
variation  existing  between  the  reporting  dates  of  the  same 
firm.   The  time  period  elapsed  between  the  last  tv/o  reports 
ranged  from  5  to  21  months  (and  in  one  case  23  months).  This 
situation  was  particularly  serious  for  the  failed  firms. 
For  these  firms,  the  mean  value  of  the  period  was  12  months, 
but  only  42  percent  of  the  firms  had  exactly  12  months' 
elapsed  time.   The   nonf ailed  firm.s  showed  less  variation. 
The  mean  value  of  the  period  in  this  group  was  also  12 
months,  but  the  percentage  of  firms  having  exactly  12 
months'  elapsed  time  period  were  73.   In  one  experiment,  an 
attempt  was  made  to  "normalize"  these  time  variations.   The 
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method  used  and  the  result  obtained  will  be  discussed  later 
in  the    section  on  "Potential  Biases." 

Another  point  to  be  noted  here  relates  to  the 
selection  of  appropriate  years  for  data  from  the   nonf ailed 
firms.   Although  it  might  have  been  desirable  to  have  the 
same  number  of  financial  reports  from  the  nonfailed  group 
in  each  year  as  from  the  failed  group,  the  exact  matching 
was  not  easily  possible  without  introducing  some  biases. 
The  bulk  of  the  last  reports,   nevertheless,  is  in  the 
years  1971  and  1972  in  both  groups  (65  percent  in  the  failed 
group;  70  percent  in  the  nonfailed  group).   Furtherm.ore ,  the 
inclusion  of  GNP  or  similar  variables  (among  the  variable 
set) ,  which  takes  into  account  the  extent  of  the  business 
activities  in  each  year,  may  remove  the  remaining  effects  of 
such  differences. 

In  addition  to  the  above  forms,  some  other  sources 
of  data  were  also  utilised.   Schedule  D  of  Form  BD  (SEC 
form)  was  used  to  obtain  som.e  information  about  the  presi- 
dent of  the  firm.   Am.ong  other  things,  this  form  has  sec- 
tions on  the  age,  education,  and  business  experience,  in 
the  past  ten  years,  of  the  person  involved.   Data  on  GNP, 
Dow- Jones  and  Standard  and  Poor's  stock  price  indexes, 
m.arket  interest  rates,  and  market  value  of  the  sales  of 
stocks  and  lands  were  obtained  from  Statistical  Abstract 
of  the  United  States.   Finally,  revenue  sharing  documents 
and  County  and  City  Data  Books  wore  used  for  obtaining 
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information  on  the  population  and  per  capita  income  of  the 
places  where:  the  firms  had  their  main  offices. 

An  im.portant  point  to  be  noted  in  connection  with 
the  financial  reports  is  the  wide  variation  which  exists 
among  the  failed  firm.s  with  respect  to  the  last  report 
lag  time  (LRLT)  v:hich  is  the  time  elapsed  betv/een  the  last 
report  and  the  filing  date  of  the  SIPC  app] ication  in 
court  for  liquidating  the  subject  firms.   As  shown  in 
Appendices  A  and  C,  the  LRLT  ranges  from  2  months  to  as 
long  as  32  months.   Its  mean,  liowever ,  is  about  11.5  mionths, 

Selection  of  Samples 

Fai].ed  Firm^s 

In  selecting  broker/dealer  firms  to  be  included  in 
the  sample,  attention  was  first  paid  to  the  failed  group. 
This  group's  population  consisted  of  109  firms  which  had 

been  placed  in  liquidation  by  SIPC  during  the  four-year 

2 

period  ending  December  31,  1974.    One  of  the  characteris- 
tics of  the  firms  v;hich  had  a  bearing  on  their  selection 
was  their  number  of  years  in  business.   The  distribution  of 
this  factor  for  the  failed-firm  population  is  shown  in 
Table  2. 

For  the  purpose  of  this  study,  wh. Ich  incorporates, 
am.ong  other  things,  the  standard  deviation  of  some  of  the 
data  items,  it  was  desirable  to  have  as  many  financial 
reports  (annual  in  this  case)  on   each  firm  as  possible,  and 
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TABLE  2.   THE  FAILED-FIRM  POPULATION  BY  YEARS  IN  BUSINESS 
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never  1l:3~,    ihan    two.   Rut,  of  109  failed  firms,  79  firras 
v/ere  more  than  two  years  old  and  only  53  firms  v/ere  more 
than  three  years  in  business.   In  addition,  reviewing  the 
files  of  the  firm.s  revealed  that  some  of  the  firms  either 
had  not  filed  their  last  reports,  or  had  some  missing  reports 
among  their  last  three  ones.   Furtheri:iore ,  during  the  data- 
collecting  period  some  of  the  files  were  not  available. 

Under  these  circumstances,  it  v/as  decided  that  the 
last  tv.'o  reports  v/ould  be  the  minimum  num.ber  of  reports  to 
be  collected  on  failed  firms  and  the  last  three  the  maxim.um 
appropriate  ones.   Thus,  the  following  criteria  were  set  to 
select  the  failed  firm  sample: 

1.  Availability  of  files. 

2.  Availability  of  two  years  or  more  of  financial 
data. 

3.  No  m.issing  reports  between  last  tv;o  or  three 
reports . 3 

By  using  these  criteria,  60  firras  were  selected 

and  these  firms  actually  exhausted  the  entire  failed  group 

4 
population.    In  the  next  step,  the  60  firms  were  divided 

randomly  into  two  equal-size  groups.   One  group  was  used  in 
the  original  sample  by  which  the  discriminant  function  .is 
calculated  (for  the  names  and  some  chai.^acter istics  of  the 
firms  in  this  group,  see  Appendix  A).   The  other  group  was 
used  in  the  holdout  sample  by  which  the  reliability  of  the 
constructed  model  is  tested  (for  names  and  some  characteris- 
tics of  the  firms  in  this  group,  see  Appendix  C) . 
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Nonfailod  Firms 

Selection  of  the  nonfailed  firnss  v;as  much  simpler 
than  that  of  the  failed  c:\t,s.   These  firms  v;cre  randomly 
selected  from  a  SEC  directory  which,  contained  the  names  of 
all  the  broker/dealer  firm.s.   There  were,  however,  some 
limitations  which  required  the  adoption  of  the  following 
criteria : 

1.  Existence  of  the  selected  firm,  i.e.,  it  has 
not  been  previously  liquidated. 

2.  Availability  of  files. 

3.  Availability  of  financial  data  for  at  least 
three  consecutive  years. 

By  using  the  above  criteria,  60  firms  were  selected 

to  comprise  the  nonfailed  group.   These  firms  were,  then, 

divided  randomly  into  two  equal-size  groups.   One  group  v/as 

used  in  the  original  sample  (the  names  of  the  firms  in  this 

group  are  listed  in  Appendix  B) ;  and  the  other  was  used  in 

the  holdout  sample  (the  names  of  the  firms  in  this  group 

are  listed  in  Appendix  D) . 

Data  Set 
After  sample  selection  and  collection  of  the  re- 
laced  available  Forms  and  other  sources  of  information  were 
com:pleted,  a  schedule  was  prepared  to  obtain  a  certain  set 
of  data  items  from  the  collected  m.aterial.   This  schedule 
incorporated  several  kinds  of  information,  including:   in- 
formation on  certain  characteristics  of  the  firm's  president; 
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inl'vOrm.T  t  ion  about  the  financial  report  under  consideration; 
data  on  certain  financial  items;  inforr.'.ation  on  non- 
financial  charact:eristics  of  the  firm;  information  on  some 
aspects  of  the  city  of  the  firm's  m.ain  business;  and  data 
on  some  indexes  of  the  business  activity  during  the  report- 
ing period.   The  exact  item.s  of  the  date  set  are  listed  in 
Appendix  E. 

Although  an  effort  v.'as  made  to  incorporate  only 
those  data  items  v;hich  wore  common  to  most  of  the  firms, 
occurrence  of  m.issing  data  in  many  places  was  inevitable. 
There  v;ere  several  causes  for  this  problem.   The  first  cause 
was  the  inavailabi lity  of  the  related  source.   The  lack  of 
Schedule  D  of  Form,  BD ,  for  example,  was  the  reason  that  in- 
form.ation  on  the  president's  characteristicn  could  not  be 
obtained  for  some  firms.   Replacement  of  Balance  Sheet  or 
Form  X-17A-11  for  Form  X-17A-5  was  the  second  one.   Type  of 
the  firm  was  the  third  one. 

In  this  regard,  if  the  firm  was  a  corporation,  its 
retained  earnings  could  be  easily  singled  out,  whereas 
separation  of  triis  item  for  some  of  the  partnership  and  most 

of  the  sole  proprietorship  type  of  organizations  was  not 

5 
possible  without  making  some  assumptions.    The  total  num- 
ber of  corporations,  partnerships,  and  sole  proprietorships 
in  the  failed  group  were,  respectively,  54,  3,  and  3;  the 
numbers  of  such  organizations  in  the  nonf ailed  group  were, 
respectively,  45,  8,  and  7. 
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The  last  cause  cf  the  missi'~g  data  problem  was  the 
method  used  by  firms  .for  reporti:;g  tt-.eir  financial  items. 
While  some  firms  had  reported  their  data  in  detail,  others 
had  reported  only  totals.   As  an    example,  most  firms  had 
reported  both  "failed  to  deliver  within  30  days"  and  "failed 
to  deliver  over  30  days."   There  \'jere    several  firms,  however, 
v;hich  had  reported  only  the  "total  failed  to  deliver," 

The  m.Lsslng-data  problem  is  not,  of  course,  some- 
thing specific  to  the  present  study.   In  fact,  in  most 
cases,  wherever  an  empirical  research  of  this  nature  exists, 
the  missing-data  problem  goe^s  with  it  hand-in-hand.   This 
situation  has  attracted  the  attention  of  many  writers,   and 
several  methods  have  been  suggested  for  solving  the  oroblem. 
The  easiest  methods  are  either  eliminating  the  observations 
which  have  the  missing  data,  or  omitting  the  variable  in 
which  missing  dc"ita  are  found.   But,  neither  of  these  methods 
fits  all  situations.   If  the  number  of  observations  is 
limited,  it  is  both  uneconomical  and  im.practical  to  discard 
the  whole  set  of  data  for  the  one  or  two  missing  items.   The 
requirement  of  having  a  sufficient  number  of  degrees  of 
freedom  also  precludes  such  a  treatment.    The  omission  of 
variables  may  also  be  at  times  infeasible.   This  may  be  due 
to  the  theoretical  im.portance  attached  to  the  subject 
variable. 

Some  other  methods  which  have  been  suggested  are 
Buck's  regresssion  m.ethod ,  Dear's  principal  component  method. 
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and  lacan  jsnbstitut ior\  method.   Th.G  pro5..T.t  study  has  used 

mainly  the  mean  suijstitution  method  for  the  treatment  of 

7 
missing  values.    In  doing  so,  each  variable  in  each  group 

(failed  or  nonfailed)  and  in  each  sample  (original  or  hold- 
out) was  considered  separately,  and  the  moan  value  of  the 
variable,  based  on  all  available  values  in  that  group  of 
that  sample,  was  used  for  the  missing  values  in  that  group 
of  that  sample  only.   The  choice  of  the  method  has  been  due 
to  the  limitation  in  the  number  of  observations,  the  analyti- 
cal value  of  the  variables,  and  the  result  of  an  experiment 
conducted  by  Chan  and  Dunn   which  shov/s  the  superiority  of 
the  mean  substitution  method  (and  principal  component 
method)  to  other  methods. 

Test  I  -  One  Year  in  Advance  of  Bankrupt c y 

Introduction 

In  the  section  on  "Source  of  Data, "  it  was  stated  that 
the  last  report  lag  time  (LRLT)  of  the  failed  firms  ranges 
from  2  months  to  32  months,  its  mean  value  being  11.5  months. 
With  this  in  mind,  if  the  attempts  in  constructing  a  dis- 
crim-inating  model  which  utilizes  the  information  contair;ed 
in  the  last  reports  prove  to  be  successful  in  correctly 
differentiating  most  of  the  failed  firms  from  nonfailed  firms , 
the  model's  predictive  span  may  be  construed  as  being  about 
one  year.   Building  up  such  a  model  constitutes  the  first 
experiment.  Test  I. 
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Variable  ScJ^ 

The  first  step  in  constructing  the  exoeriinent  is 
the  identification  of  the  potential  factors  which  might 
differentiate  between  financially  distressed  securities 
brokerage/dealer  firms  and  those  which  appear  to  be  eco- 
nomically viable. 

If  the  collected  data  could  have  been  exactly  matched 
with  the  analytical  fram.owork  set  forth  in  Chapter  II,  there 
would  have  been  no  problem  and  no  further  discussion  would 
have  seemed  necessary  in  this  regard.   Unfortunately, 
however,  the  limitation  encountered  on  the  data  side  has 
required  somc:  substitutions  and  reconsiderations  of  the 
relevant  factors.   However,  for  the  most  part,  the  same  line 
of  reasonings,  as  explained  in  Chapter  II,  has  been  followed 
in  determining  the  variables  to  be  included  in  the  model. 
A  list  of  the  variables  selected  in  this  way  is 
included  as  Appendix  F.   Although  the  list  has  not  been 
ordered  in  this  way,  each  variable  has  its  place  in  one  of 
the  general  classifications  of  profitability,  risk,  and  mis- 
management or  in  one  of  the  specific  classifications  there- 
under, such  as  internal  or  external  factors,  as  explained  in 
Chapter  II.   In  particular,  variables  related  to  the 
attributes  of  a  firm's  president,  accounts  failed  to  deliver 
and  failed  to  receive,  and  the  relative  significance  of  the 
property  and  fixture  accounts  wore  specifically  designed  to 
proxy  the  mismanagement  element  of  the  hypothesized  function 
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Variables  relating  to  the  version  of  Tchebychcf f ' s  in- 
equality and  gonor^lly  most  of  the  variables  in  the  form 
of  coefficient  of  variation  have  been  introduced  to  take 
the  firm's  risk  exposure  into  account.   Those  variables 
which  are  traditionally  considered  as  leverage  factors 
also  fall  uiider  risk  category.   And,  the  remaining  variables, 
including  those  related  to  the  general  business  activities 
and  the  characteristics  of  the  city  where  a  firm's  business 
is,  as  v;ell  as  the  traditional  variables  affecting  the 
profitability  of  a  firm,  fall  in  the  profitability  category. 

At  the  time  of  reporting  the  final  variable  sub- 
set selected  by  the  MDA  model,  a  brief  discussion  will  be 
made  of  some  of  those  variables  which  showed  up  as  relatively 
more  important  than  the  others.   At  this  point,  however, 
some  explanation  about  variables  X-50,  X-51,  and  X-73  through 
X-8  2  seems  in  order. 

When  the  concept  of  Tchebychef f ' s  inequality  (see 
equation  30  in  Chapter  II)  was  presented  in  the  context 
of  a  m,easure  of  risk,  it  was  concluded  that  this  type  of 
equation  can  be  used  to  estimate  the  probability  that  the 
value  of  a  random  variable,  such  as  retained  earnings,  falls 
at  or  below  a  disaster  level.   This  idea  was  utilized  in 
creating  variables  X-50,  X-51,  and  X-73  through  X-82,  but 
with  some  modification  wliich  will  be  explained  below. 

Rather  than  using  Tchebychef f ' s  inequality  in 
the  original  form,  the  reciprocal  of  its  square  root  was 
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utilizoa.   '''ore  exactly,  the  variables  were  created  by 

2       ■^ 
using  (E-d)/.:  rather  than  using  ■■    /(hl-d)".   There  were  two 

reasons  for  such  a  treatment:   (I)  As  is  seen.,  in  the 
original  version  the  expression  (E-d)  is  in  the  denominator. 
It  happens  soraetiraes  that  E  becomes  equal  to  d,  resulting 
in  an  undefined  ratio.   This  problem  is  removed  by  invert- 
ing the  fraction;  (2)  In  the  original  version  the  expres- 
sion (E-d)  has  been  squared.   This  kil].s  the  negative  sign 
of  the  result  for  the  cases  where  E<d.   The  distinction 
between  cases  in  which  the  mean  value  of  the  variable  devi- 
ates from  above  or  fromv  below  the  disaster  level  is,  however, 
of  m.uch  value  to  the  purpose  of  this  study.   Avoidance  of 
the  squaring,  of  course,  solves  this  problem.   The  result- 
ing version  (E-d) /a,  may  also  be  thought  of  as  the 
standardized  devi  ati  on  of  mean  from,  the  disaster  level . 

Another  point  of  possible  interest,  with  respect  to 
the  variable  set,  is  the  existence  of  high  correlations 
betv.'een  some  of  the  variables,  in  all  tests  (see,  for 
exam.ple,  the  total  correlation  m.atrix  of  Test  I  in  Appendix 
G)  . 

The  extent  of  the  relations  ranges  from  perfect 
positive  to  perfect  negative  correlation.   In  some  cases, 
the  perfect  positive  correlations  exist  between  those 
variables  which  have  been  based  upon  one  significant  common 
element  of  the  data  item.   For  example,  X-7,  X-8,  and  X-9 
which  have  cash  item  in  common,  are  perfectly  positively 
correlated  with  each  other.   In  other  cases,  the  perfect 
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positive  correlations  .ire  found  anong  th-,-  variables  such 
as  X-49,  X-55,  X-56,  X-59,  and  X-84,  v;hich  are  indicators 
of  the  level  of  the  general  business  activities.   Finally, 
two  exanplos  of  perfect  negative  correlation  are  the  corre- 
lations betv.'een  X-5  and  X-50,  and  X-6  and  X-51  which  equal 
to  -1,  in  both  cases. 

The  existence  of  such  high  correlations  betv;een 
some  variables  necessitates  oiidssion  of  one  of  the  variables 
from  each  correlated  pair.   This  treatment  is  necessary 
because  it  eliminates  computational  and  operational  diffi- 
culties.  If  such  variables  are  not  removed  from  the  analy- 
sis, negative  square  roots  result  during  the  process  of 
inverting  the  dispersion  matrices.   Even  if  a  little  less 
than  perfect  correlation  makes  the  convergence  on  the  pro- 
per inverse  matrix  possible,  the  resulting  relations  may 
not  be  stable.   In  the  present  study,  in  all  tests,  one  of 
the  variables  in  the  pairs  having  correlation  coefficient 
equal  to  or  greater  than  .87  have  been  omitted.   The  choice 
has  been  based  principally  on  considerations,  such  as 
fewer  missing  data  on  and  easier  interpretation  of  the 
retained  variables.   The  variables  omitted  due  to  the  cor- 
relation have  been  m.arked  by  asterisks  in  the  variable 
list  in  Appendix  F. 

A  final  point  with  respect  to  the  variable  set 
is  the  variable  structure  of  total  observations.   A  table 
including  the  mean,  standard  deviation,  standard  error  of 
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mean,  maxir.um,  and  r>iiniri'.i:r.  of  each  variable  in  each  fai  led  group 
(60  firrr.s)  an^I  nonf ailed  group  (60  firns)  is  included, 
separai-ely,  in  Appendi.::  H.   The  univariate  F-tosts  of  the 
significance  of  mean  se;)ara*,ion  between  the  two  groups 
have  also  been  included  in  this  appendix. 

Discriminant  Analysis 

In  providing  the  groundwork  for  the  present  analy- 
sis, the  theoretical  aspects  and  general  procedure  for  the 
utilization  of  the  MDA  technique  were  discussed  in  Chapter 
III,  rather  in  detail.   Here,  the  concern  revolves  around 
the  specific  procedure  and  relevant  points  in  applying  the 
method  to  the  case  of  the  brokerage/dealer  firms. 

Methodology.   After  all  the  relevant  variables  were 
determined,  the  original  sample  was  used  for  deriving  the 
discriminating  function.   Since  the  number  of  variables  was 
84  and  the  number  of  observations  in  each  of  the  f  a.i  led  and 
nonf ailed  groups  in  the  original  ssimple  was  30,  a  one-step 
method  was  not  possible  in  computing  the  function.   The 
maximum  number  of  the  variables  which  could  be  included  in 
the  analysis,  while  being  able  to  conduct  all  the  related 
statistical  tests,  was  28  (the  number  of  observations  in 
each  group,  30,  minus  the  num.ber  of  groups,  2).   Therefore, 
the  following  procedure  v/as  used  in  finding  the  "optimal" 
subset  of  variables. 

Run  1.   By  utilizing  the  Eisenbeis-Avery  MDA 

9 

computer  program,   the  lirst  bO  variables  (excluding  X-1  , 
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the  firm's  idc;;  ti  f  ication  number)  v;cvo    first,  used  for 
determining  tht?  correlated  variables. 

Run  II.   Then,  the  last  50  variables  were  run  for 
the  sarne  purpoL-e  as  above.   The  overlap  was  intended  to  ob- 
tain more  information  on  the  correlations.   The  results  of 
these  two  runs  led  to  the  omission  of  22  variables  from 
further  consideration,  leaving  61  variables  for  the  analy- 
sis (see  Appendix  F) . 

As  was  discussed  in  Chapter  III,  among  the  various 
procedures  v/hich  can  be  used  for  selecting  the  last  subset 
of  variables,  the  complete  stepv/ise  method  is  the  only  one 
which  can  guarantee  that  the  selected  variables  are  the  best 
ones.   Yet,  its  use  is  limited  to  the  cases  where  there  are 
only  a  few  variables.   As  the  nuniber  of  variables  increases 
the  required  computing  time  becomes  a  forbidding  problem. 

An  alternative  method  wnicii  seems  reasonably  accu- 
rate and  feasible  is  the  midway  m.ethod  betvreen  backv/ard  or 
forward  stepwise  procedures  and  the  com.plete  stepwise  method. 
It  is  not  as  good  as  the  latter,  but  it  im.proves  the  result 
of  the  former.   This  alternative  v;hich  has  been  used  in  the 
present  v;ork  combines  the  forward  and  backward  stepwise 
procedures  in  the  last  part  of  the  analysis. 

Run  III.   By  using  the  forward  stepwise  procedure, 
the  first  28  variables  (disregarding  the  omitted  variables) 
v/ere  run  and  the  "best"  14  variables  from  this  subset  were 
selected . 
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Run  IV.   The  same  procedure  as  above  v.'as  used  for 
obtaining  the  second  14-variable  subset.   This  left  5 
variables  unexamined  at  this  point. 

Run  V.  The  28  selected  variables  obtained  from 
Runs  III  and  IV  were  run  by  a  forward  stepwise  procedure 
and  the  "best"  14  variables  were  selected. 

Run  VI.   The  same  28  variables  were  then  run  by  a 
backward  stepwise  procedure  and  again  the  "best"  14  vari- 
ables v.'ere  selected. 

The  results  of  Run  V  and  Run  VI  are  tabulated  in 
Table  III.   Comparison  of  these  results  is  useful  in  further 
screening  of  variables.   As  is  seen  in  the  table,  IC  vari- 
ables have  been  selected  for  both  forward  and  backward 
procedures,  8  variables  by  one  or  the  other  procedure,  and 
the  remaining  10  by  neither  of  them.   By  excluding  these 
last  10  variables  from  further  analysis  another  step  is 
taken  toward  the  final  selection.   The  logic  behind  this, 
treatment  is  that  if  a  variable  does  not  appear  in  the 
selected  14  variables  by  either  the  forward  selection 
method  or  the  backward  selection  procedure,  it  signifies 
that  the  variable  most  likely  lacks  any  discriminant  pov/er . 

Run  VII.   This  last  run  contained  the  10  variables 
common  to  both  procedures,,  the  8  \'ariables  selected  by  only 
one  or  the  other  procedure,  and  th.e  5  unexamined  variables 
left  from  Run  II.   These  23  variables  were  run  by  using  the 
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forvard  selection  method,  and  a  subset,  of  12  variables  were 
finally  selected. 

The  choice  of  12  has  been  due  to  cor'ipromise  between 
accuracy  and  complexity  of  the  raodel.   The  criterion  for 
accuracy  v;as  Rao's  F  test,  and  the  raeasure  for  complexity 
was  inclusion  of  additional  variables.   The  percentage 
significance  ]evel  of  Rao's  F  test    serves  as  a  relative 
indicator  of  the  reduction  in  discriminatory  power  of  the 
entire  variables  set  by  using  each  smaller  subsets.   This 
significance  level  for  t'le  12  variable  subset  was  99.6  per- 
cent, whereas   the  related  values  for  11-variable  subset  and 
13-variable  subset  were,  respectively,  93.7  percent  and 
99.8  percent.   Selection  of  12  variables  seems,  therefore, 
to  be  a  reasonable  trade-off  between  accuracy  and  complexity 
when  the  gain  in  precision  from  11  to  12  variables  (5.9 
percent)  is  compared  with  the  loss  in  precision  from  13 
to  12  variables  (0.2  percent). 

The  selected  variables.   Thus,  as  a  result  of  the 
above  procedure,  12  variables  were  selected  to  construct 
the  discriminating  model.   These  variables,  along  v;ith  some 
additional  information  about  their  properties,  have  been 
included  in  Table  4.   But,  before  the  results  of  such 
selection  are  analyzed,  som.c  explanation  of  the  reasons 
for  including  these  variables  seems  necessai-y. 

As  mentioned  earlier,  each  of  the  t-ariables 
included  in  tiie  analysis  has  its  place  in  one  of  the  three 
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categories  corrospcnd  ing  to  the  three  eleinents  in  the 
hypothesized  function.   Although  3on:C  of  the  variables  may 
appear  to  belong  to  more  thian  one  group,  the  intention  has 
been  to  include  them  in  a  category  wh.ere  their  differentiat- 
ing povvor  seems  to  operate  more  directly  and  forcefully. 

Profitability.   Variable  X-12,  which  is  the  ratio 
of  retained  e.-irnings  to  total  assets  clearly  falls  in  this 
category.   The  higher  the  profit  is,  the  more  likely  that 
the  ratio  becom.es  larger.   Percentage  change  in  equity 
capital,  X-45,  also  signals  the  extent  of  profitability. 
If  the  perspective  of  the  firm's  business  is  promising  the 
capital  is  usually  increased.   In  retrospect,  also,  a 
fruitful  ye^^r  increases  the  proportion  of  the  equity  capi- 
tal with  respect  to  the  previous  one.   The  reverse  is,  of 
course,  true  for  a  g].oomy  perspective  and  a  year  full  of 
losses.   Furthermore,  the  sign  of  the  ratio  indicates  the 
direction  of  the  change.   Positive  profit  is  most  likely 
accompanied  by  a  positive  percentage  cliange  in  equity 
capital  and  negative  profit  by  a  negative  change. 

Other  variables  which  belong  to  this  category 
are  X-5  and  X-58.   The  relation  of  a  firm's  age  (X-5)  to 
its  profitability  and  the  role  that  the  interest  rate  (X-5S) 
plays  in  a  firm.'s  revenue  and  expense  accounts  have,  how- 
ever, been  discussed  in  detail  in  Chapter  II. 

VJhile  most  of  the  above  variables  are  internal 
to  the  firip. ,  that  is,  the  firm  has  control  of  thoni,  interest 
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rate  variable  is  exLernal  to  the  firr.!.   The  appearance  of 
this  external  factor  airiong  Lho  factors  differentiating 
failed  firms  fro::!  nonfaj  led  f irns  raay  be  a  further  induce- 
nent  for  reconsideration  of  the  traditional  approach  in 
solving  a  similar  problem  which  utilizes  the  internal  fac- 
tors only. 

Risk.   Some  of  the  variables  which  fall  in  this 
category  are  versions  of  the  traditional  financial  ratios 
dealing  with  leverage.   But,  others  are  not  that  familiar. 
X-15  and  X-lb  need  little  explanation.   It  is,  however, 
worthwhile  to  note  the  role  v;hich  subordinated  debt  plays 
in  these  ratios.   Subordinated  debt  is  generally  considered 
as  a  part  of  tne  firm's  capital  for  indemnification  of  other 
creditors.   Yet,  it  is  viewed  as  a  devastating  liability  by 
some  of  the  equity  owners.   As  reported  in  Chapter  I,  this 
very  subordinated  debt  has  put  several  firms  in  the  red, 
due  to  untimely  withdrawal  or  improper  execution  of  its 
related  agreement. 

X-22  is  the  ratio  of  the  sum  of  the  partly  secured, 
secured,  and  unsecured  customers''  securities  accounts  to 
the  firm's  total  assets.   Holding  other  things  constant, 
a  firm  which  conducts  most  of  its  securities  business  on 
the  cash  basis  or  deals  m.ostly  ^vith  fully  secured  accounts, 
in  the  case  of  margin  transactions,  bears  less  risk  than 
another  firm  having  relatively  large  volum.es  of  partly 
secured  or  unsecured  eiccounts. 
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X-52  and    X-G9  relate  to  the  trodinq  and  investment 
account.s.   Firiris  v.iiich  :;ely  more  on  ccr.'jnissions  and  other 
sources  of  revenue  Vv-hich  arc  nc^ro  stable  are  less  subject 
to  risk  than  those  v/hich  concentrate  more  on  trading  of  and 
investing  in  securities.   The  existence  of  volatility  in 
some  of  these  accounts,  as  measured  by  their  coefficient 
of  variation,  probably  adds  to  this  risk.   This  may  be  due 
to  such  factors  as  the  high  riskiness,  or  gambling  nature, 
of  such  accounts.   furthermore,  the  higher  is  the  percentage 
of  such  accounts  on  short  position,  the  higher  becom.es  the 
risk  that  the  firm  is  exposed  to. 

X-78  measures  the  risk  associated  with  the  devia- 
tion of  the  ratio  of  the  partly-secured  and  unsecured  ac- 
counts to  total  assets  (X-22)  from  the  related  disaster 
level  (being  the  moan  of  the  same  factor  in  the  failed  group 
firms).   The  higher  is  the  deviation  from  the  above,  the 
higher  becomes  the  risk  exposure  of  the  firm,  and  vice 
versa. 

Mismanagement.   The  inclusion  of  X-6,  the  age  of 
the  firm's  president,  in  trie  analysis,  was  based  on  the 
assumption  that,  holding  other  things  constant,  the  ability 
of  the  president  in  managing  the  firm's  affairs  is  directly 
related  to  his  overall  experience  which  in  turn  is  a  func- 
tion of  ago.   In  reviewing  the  data  on  the  failed  and  non- 
failed  groups,  it  was  noticed  that  there  were  somtC  differen- 
ces between  these  groups  in  this  regard.   This  point  was 
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a]  so  noted  by  Lhc;  subconmi  tto'"  on  Commerce  and  Finance  v/hich 

12 
recommends  the  following: 

...The  subcomraittoo  believes,  however, 
that,  in  addition,  uniform  minimum  age 
and  experience  qualifications  should  be 
established  by  the  SEC  and  therefci"e 
recommends  that  sections  15  and  15 A  of 
the  Securities  Exchange  Act  be  am.ended 
to  direct  the  SEC  to  establish  such  mini- 
mum s  t  a  n  d  a  r  d  s  .  .  .  , 

X-53,  the  ratio  of  the  property  and  fixtures  to 
total  assets  may  also  measure  the  ability  of  the  mianagement 
to  put  the  available  resources  to  their  best  use.   If  a 
relatively  large  part  of  such  scarce  resources  is  tied  up 
in  an  asset  which  does  not  take  its  part  in  generating 
revenue,  the  ability  of  the  m.anagcment  is  q'cestionablc . 
Variable  X-31  measuers  the  standardized  deviation  of  the 
mean  of  X-53  from  its  related  disaster  level  (calculated  by 
averaging  the  same  in  the  failed-f irm  group) . 

H^  test  of  equality  of  dispersion  natrices.   Be- 
fore the  selected  variables  can  be  used  for  constructing  a 
discriminant  function,  i.t  is  necessary  to  have  knowledge 
about  the  equality  of  the  dispersion  miatrice-s  of  the  sampled 
group.   As  discussed  in  Chapter  III,  if  these  matrices  are 
equal,  the  proper  discrim.inant  function  is  a  linear  one. 
Otherwise,  a  quadratic  function  should  be  used. 

By  using  the  Box's  approach  (see  equations  (1) 
through  (10)  of  Chapter  III),  the  computer  program,  performed 
the  H,  test  for  determining  such  equality.   With  the  first 
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dc-groG  of  frccdora    (greater  mean  squarr-)  equal  to  7  8  and  the 
second  degree  of  freodora  equal  to  10622.8,  the  related  F 
statistic  equaled  6.068  which  is  sianif leant  at  0.0  percent 
level.   Thus,  the  hypothesis  that  the  matrices  are  equal  is 
rejected,  beyond  any  doubt.   The  proper  discriminant  func- 
tior:  is,  therefore,  a  quadratic  one. 

The  discrim.inant  function  and  its  analysis.   In 
Chapter  III,  it  was  stated  that  the  MDA  technique  can  be 
used  in  tv;o  capacities:   (1)  in  analyzing  the  differences 
between  two  or  more  groups  and  making  inferences,  and  (2) 
in  assigning  individuals  to  the  group  to  which  they  have 
the  greatest  resemblance,  according  to  a  certain  set  of 
attributes,  and  miaking  predictions. 

For  inferential  use  of  the  technique  the  linear 
discriminant  function  which  is  constructed  by  the  12  se- 
lected variables  is  utilized,  despite  the  result  of  the  H^ 
test.   The  reason  for  such  a  choice  will  be  explained 
shortly.   For  predictive  use  of  the  technique,  as  well  as 
for  validity  tests,  the  quadratic  function  is  used. 

There  are  two  reasons  for  using  the  linear,  rather 
than  the  quadratic,  discriminant  function  for  the  inferen- 
tial purposes: 

(1)  The  quadratic  function  with  such  a  large  numbej 
of  variables  is  too  com.plicated  to  make  any  meaningful  anal; 
sis  possible. 
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(2)  As  will  bo  shown  later,  due  to  the  existence 
of  a  large  distance  between  the  means  of  the  two  groups  as 
well  as  the  virtually  nonexistent   of  overlap  between  the 
groups,  there  will  be  little  difference  between  the  results 
of  the  linear  versus  the  quadratic  procedure.   In  this 
regard,  the  trade-off  between  slightly  less  precision  and 
disproportionately  more  complexity  is  well  worthwhile. 

It  should  also  be  noted  that  the  inferential  use 
of  the  linear  function  does  not  prevent  the  use  of  a 
quadratic  function  in  classification;  nor  does  it  change 
any  result  thereunder. 

As  was  stated  in  Chapter  III,  there  are  several 
alternative  raethods  of  computing  the  linear  discriminant 
function.   One  way  is  to  use  the  matrix  of  eigenvectors 
associated  with  the  latent  roots  of  the  determinental 
equation  |Q-AW!  =  0,  where  Q  is  the  matrix  of  the  weighted 
among-qrouos  deviations  sum  of  squares  of  X,  and  W  is  the 
matrix  of  pooled  within-groups  deviations  sum  of  squares. 
This  is  the  method  used  by  the  computer  program  for  present- 
ing the  inferential  part  of  the  model.   For  computing  the 

14 
classification  equations  in  the  test  space,    however,  a 

more  direct  method,  similar  to  the  one  explained  in 

Chapter  III,  has  been  used.   But,  so  far  as  the  functions 

computed  in  different  ways  are  proportional  to  each  other, 

the  choice  makes  little  difference.   This  is,  of  course, 

due  to  the  fact  that  only  the  ratios  of  the  coefficients 

of  the  function  are  important. 
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Tabic  4  shov.'s  the  coefficient?",  of  the  linear 
discr irnin£.nt  function  against  the  related  variables.   The 
relative  importance  of  each  variable  is  also  shown  in  an  adja- 
cent column.   An  analysis  of  these  values  in  conjunction 
with  the  other  values  in  the  table  may  shed  some  light  on 
the  issues  for  which  the  present  research  was  undertaken. 

By  nov;,  it  should  have  been  established  r.hat  one 
of  the  most  significant  features  of  the  MDA  is  its  ability 
to  drastically  reduce  the  dimensions  of  the  problem  on  hand. 
In  the  present  case,  this  reduction  in  dimensions  has  been 
performed  through  multiplying  the  coefficient  of  each 
variable  by  the  value  of  that  variable  in  each  original 
observation,  sumndng  up  the  results  for  all  the  variables 
in  the  function,  and  arriving  at  a  discriminant  score  for 
each  observation.   The  centroid  or  the  m.ean  of  such  dis- 
criminant scores  for  the  failed  group  amounted  to  0.03546 
and  that  for  the  nonf ailed  group  to  0.14356. 

Thus,  the  lower  discriminant  score  is  associated 
with  the  failed  group  and  higher  scores  with  the  nonfailed 
group.   Therefore,  if  the  contribution  of  a  variable  in  the 
function  is  toward  the  increment  in  the  discriminant  score, 
the  higher  values  of  the  variable  are  associated  with  non- 
failure,  whereas  the  lower  values  are  related  to  failure. 
More  specifically: 

X-5  is  the  firm's  fige.   Its  coefficient  is  positive, 
Thus,  the  higher  the  age  is,  the  greater  becomes  the 
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discriminant  score,  and  the  higher  becomc-y  the  probability 
that  the  firm  falls  in  th-.?  nonfai  led  group.   The  reverse  is, 
of  course,  true.   The  lov.-er  the  age,  the  smaller  becomes 
the  discrim.inant  score,  and  the  higher  becomes  the  prob- 
ability that  the  firm  falls  in  the  failed  group.   In  making 
inferences,  the  logic  is  reversed:   Given  that  the  firm  is 
a  failed  one,  and  holding  other  things  constant,  its  low 
discriminant  score  is  associated  with  its  low  age.   This 
result  is  congruent  with  what  v;as  expected  and  consistent 
with  what  was  found  by  the  NASD  study.   The  variable,  how- 
ever, accounts  for  only  6.43  percent  of  the  total  discri- 
minatory pov/er  of  the  function  and  ranks  8  ai-iong  the  12 
variables. 

The  variable  associated  with  the  age  of  the  firm's 
president,  X-6,  also  shares  the  same  attributes  assigned  to 
that  of  the  firm's  age.   That  is,  the  failed  firms  show  that 
their  presidents  v;ere,  on  average,  younger  than  the  non- 
failed  frims.   This  is  confirmed  by  looking  in  the  columns 
of  the  table  v/hich  show  the  mean  values  of  the  variables 
in  each  group.   As  is  seen  there,  the  m.ean  for  the  failed 
group  is  39.21  years,  whereas  it  is  53.44  years  for  the 
nonfailed  group.   The  univariate  F  test  ratio  for  the 
differences  between  these  two  r.ieans  is  29.5.7  which  is 
significant  at  the  .001  level  and  ranks  first  among  the 
other  variables,  in  this  regard. 
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X-12,  thG  ratio  of  recained  earnings  to  total 
assets,  has  also  a  oosjtive  relation  to  the  discriminant 
score.   It  shows  that  the  nonf ailed  firms  enjoy  a  higher 
ratio,  v;heroa3  the  failed  firms  are  plagued  by  a  low  ratio. 
In  fact,  the  mean  of  this  ratio  in  the  failed  group  is  nega- 
tive (-9.16)  which  indicates  that  the  failed  firms  have,  en 
average,  negative  retained  earnings  or  a  deficit  which 
resulted  from  cumulative  losses.   On  the  other  hand,  the 
mean  of  the  ratio  in  the  nonf ailed  group  is  as  large  as  0.23, 
This  means  that  the  retained  earnings  of  the  sample  firms  in 
the  nonf ailed  group  constitute  about  2  3  percent  of  their 
total  assets.   This,  in  turn,  indicates  the  profit-making 
ability  of  the  nonf ailed  firms.   This  variable  is  highly 
significant  both  on  the  univariate  basis  and  in  combination 
with  other  variables.   Its  univariate  F  ratio  is  21.20  which 
is  significant  at  the  .001  level.   In  the  discriminant  func- 
tion it  ranks  second,  and  accounts  for  12.24  percent  of  the 
total  discriminatory  power  of  the  function.   The  im.portance 
of  this  variable  in  the  present  case  is  consistent  with  the 
finding  of  the  NASD  study  in  this  regard. 

The  relation  between  X-15  and  the  discriminant 
score  is  also  positive.   But,  despite  -its  second  place  on 
the  univariate  basis,  it  ranks  last  in  the  function  and 
contributes  only  about  2  percent  of  the  total  discriminatory 
power.   This  variable,  nevertheless,  shows  that  nonfailed 
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firms  possess  broader  bc-.ses  of  equity-  capital  and  subordi- 
nated debt  than  their  failed  counterparts. 

X-18  shares  sinnilar  attributes  ascribed  to  X-15 
with  respect  to  its  rar^k  and  contribution.   Its  coefficient 
is,  liov/ever,  negative,  indicating  that  hig'ner  values  of  the 
variables  are  associated  with  failed  firms  and  lower  values 
with  nonf ailed  firms.   In  fact,  the  related  mean  value  of 
the  failed  group  is  almost  twice  that  of  the  nonfailed 
group.   Verbalizing,  the  failed  firms  rely  much  more  on 
financial  leverage  (excluding  subordinated  debt)  than  the 
nonfailed   firms.   In  reference  to  the  exclusion  of  sub- 
ordinated debt,  it  may  be  worthwhile  to  note  its  "noise-" 
reducing  effect  in  the  ratio.   This  effect  is  well  under- 
stood if  it  is  noted  that  variable  X-19,  which  is  the  ratio 
of  total  liabilities  (being  total  assets  minus  owners' 
equity)  to  total  assets,  has  not  been  selected  by  the  pro- 
cedure. 

As  a  leverage  measure,  the  NASD  study  has  found  that 
the  rati.o  of  (Total  Liabilities  f  Subordinated  Loans) /Owners ' 
Equity  is  a  significant  variable.   Its  equivalent  variable 
in  the  present  study,  i.e.,  X-14,  however,  has  not  been 
selected  by  the  procedure  in  this  test.   Nevertheless,  the 
conclusion  of  the  NASD  study  with  respect  to  the  higher 
leverage  (nearly  twice)  of  the  failed  firms  is  consistent 
with  the  above  findings. 
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The  ratio  of  the  p-TrtJy  secured  aiid  unsecured 
accounts  to  total  assets,  X-22,  is  inversely  related  to  the 
discriminant  score,  as  indi.cated  by  the  negative  coeffi- 
cient.  Thus,  the  failed  firms  in  the  analysis  have 
demonstrated  that  they  have  had  proporiiionately  more  cus- 
tomer securities  accounts  which  have  been  pjirtly  secured 
or  not  secured  at  all  than  the  nonf ailed  firms.   This 
variable  is  fairly  significant  in  differentiating  between 
the  two  groups,  and  ranks  fifth  among  other  variables. 

The  coefficient  of  X-45,  the  percentage  change  in 
equity  capital,  is  also  negative.   The  negsitive  sign  indi- 
cates that  the  change  is  more  pronounced  in  the  failed  firms 
than  nonf ailed  firms.   In  fact,  the  mean  of  the  ratio  in 
the  failed  group  is  four  times  larger  than  that  in  the  non- 
failed  group.   The  variable  accounts  for  8.70  percent  of  the 
total  discriminatory  power  of  the  function  and  ranks  seventh 
among  other  variables.   But,  its  univariate  F  ratio  is  not 
significant  at  the  0,10  level. 

X-52,  which  is  the  percentage  of  trading  and  invest- 
ment accounts  with  credit  balances,  also  has  a  negative  co- 
efficient.  It  m^ay  be  inferred,  therefore,  that  the  failed 
firms  are  in  a  higher  "short"  position  than  their  nonfailed 
counterparts.   The  rank  of  this  variable  is  relatively  high 
(fourth)  on  both  univariate  and  multivariate  bases  and  it 
contributes  10.23  percent  of  the  total  discriminatory  power. 
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The  fffecL  of  X-58,  tho  interest  rate  in  the  year 
prior  to  the  reporting  date,  has  been  negative  on  the  failed 
firms  and  positive  on  the  nonf ailed  firms.   The  reason  for 
this  situation  may  be  the  net  position  of  these  firms  with 
respecc  to  their  interest  income  and  interest  charges.   The 
performance  of  this  vrtriable  is  consistent  with  the  behavior 
of  X-18.   This  is  the  only  external  variable  which  has 
shown  a  relatively  significant  discriminatory  pov;er  in  the 
present  test. 

X-69,  the  coefficient  of  variation  of  X-41,  is  the 
first  ranking  variable  which  contributes  15.78  percent  of 
the  discriminatory  power  of  the  function.   This  and  the  next 
tv;o  variables  are  of  the  type  suggested  in  Chapter  II  to  be 
included  in  the  analysis  for  revealing  the  differences 
between  firms  with  respect  to  the  volatility  of  their  rela- 
ted factors  and  risk.   The  coefficient  of  the  variable  here 
is  negative,  indicating  that  the  higher  volatility  in  trad- 
ing and  investment  accounts  is  associated  v;ith  failed  firms. 
The  nonfailed  firms  in  the  saraple  have  shown  m.ore  stability 
in  their  trading  and  investment  activities.   It  may  also 
be  of  interest  to  note  that  variable  X-41,  of  which  X-69  is 
the  coefficient  of  variation,  has  not  been  selected  by  the 
procedure  due  to  its  lack  of  discriminatory  power, 

X-78  is  a  version  of  Tchebychef f ' s  inequality  for 
the  ratio  of  partly  secured  and  unsecured  accounts  to  total 
assets  (X-22) .   As  previously  discussed,  this  ratio  can  be 
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thought  of  as  the    stiindardizcd  deviation  of  X-22  from  its 
related  Gis^^ster  level.  This  variable  has  been  shown  to  possess 
a  relatively  high  discriminatory  pov.'er  (12.19  percent),  and 
its  rank  is  third  among  other  variables.   Its  negative  co- 
efficient indicates  that  the  higher  values  of  the  variable 
are  associated  with  the  failed  firras  and  the  lov/er  values 
with  the  nonf ailed  firras.   In  fact,  the  mean  of  the  variable 
in  the  failed  group  is  -1.99  which  is  much  larger  than 
-44.89,  the  mean  of  the  variable  in  the  nonf ailed  group. 
This  difference  signifies  that  the  deviation  of  the  variable 
from  its  related  level  is  sma3 1  in  the  failed  firms  (as  it 
had  been  designed  to  be,  by  averaging  the  values  of  the 
variable  in  the  failed  group  to  obtain  the  disaster  level) , 
whereas  the  deviation  in  the  nonfailed  group  is  very  large 
and  from  below. 

It  may  be  of  interest  to  note  that  both  X-22  and 
X-7B  have  appeared  in  the  function  (both  with  negative 
signs) ,  but  the  discriminatory  power  of  X-78  is  greater  than 
that  of  X-22.   It  may  also  be  worthwhile  to  note  that  the 
variable's  significance  in  the  multivariate  com.bination  is 
in  direct  contrast  to  its  insignificance  on  the  univariate 
basis  (its  univariate  F  ratio  is  2.58  which  is  not  signifi- 
cant at  the  0.10  level). 

X-81  is  of  the  same  type  as  X-78.   It  can  be 
thought  of  as  a  measure  of  the  standardized  deviation  of 
X-53  from  its  related  disaster  level.   X-53,  which  has  not 
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appeared  in  the  fvanction  due  to  the  lack  of  any  discrimina- 
tory power,  is  the  ratio  of ■property  nnd  fixtures  to  total 
assets.   X-81,  by  its  negative  coefficient,  indicates  the 
association  of  its  larger  values  with  failure.   It  shows 
that  the  percentage  of  total  assets  of  a  nonfailed  firm  tied 
up  in  property  and  fixtures  is  significantly  below  the  level 
that  a  failed  firm.,  on  average  maintains.   This  may  signify 
the  ability  of  nonfailed  firms  and  inability  of  the  failed 
firms  in  appropriate  management  of  their  limited  available 
resources.   This  variable  also  is  not  significant  on  the 
univariate  basis,  but  ranks  tenth  in  the  multivariate  com- 
bination, and  contributes  4.51  percent  of  the  total  dis- 
criminatory power  of  the  function. 

The  differences  between  the  group  means.   After  the 
"best"  feasible  subset  of  variables  was  selected,  the  H^ 
test  described  in  Chapter  III  was  conducted  to  see  whether 
the  differences  between  the  means  of  these  variables  in  the 
two  groups  is  significant  or  not. 

From  among  sever^^l  methods  available  for  this  pur- 
pose, two  were  utilized.   The  first  one  was  the  Rao's  test 

which  follows  equation  (42)  in  Chapter  III.   The  second  one 

2 

was  Hotelling's  T   which  has  been  defined  m  equation  (12) 

of  the  same  chapter.   The  results  of  conducting  these  tests 
are  reported  below. 

Rao's  test.   Rao's  test  of  equality  of  group  means 
is  an  F  test  which  relies  upon  the  Wilks   lambda.   The 
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calculated  F  statistic  of  this  tost  with  12  and  47  degrees 

of  ireodoin  was  amounted  t-o  20.96101  which  is  significant 

at  tho  0.64669  E-12  percent  level.  And,  the  Wilks   lambda 
for  the  tost  v::i:^    0.1574367. 

9 

Ilotelling's  T"  test.   This  test  has  also  an  F  dis- 
tributiors  :,ac    is  suitable  for  the  two-grcup  case.   The 
related  F  statistic,  with  12  and  47  degrees  of  freedoni, 
equaled  20.9  7496  -which  is  significant  at  the  0.638  5504  E-2 
percent  level. 

The  overlap  between  groups.   In  the  previous  para- 
graphs, it  was  concluded  that  the  differences  between  group 
means  are  highly  significant.   Although  the  existence  of  a 
relatively  large  distance  between  the  group  means  is  neces- 
sary to  obtain  the  best  predictive  performance  of  the  tech- 
nique, it  docs  not  alv/ays  guarantee  that  the  designed 
classification  scheme  is  successful.   The  cause  is,  of 
course,  a  possible  overlap  between  groups.   To  evaluate 
the  extent  of  success,  it  is  necessary  to  examine  the  extent 
of  the  overlap. 

The  Fisenbeis  and  Avery    computer  program  computed 
the  following  overlap  table.  Table  5,  by  usi'ig  an  appropriate 
chi-square  score. 

TAPLF  5.   THE  OVERLAP  BETV:EEN  GROUPS  IN  TEST  1. 


Failed  Nonfailcd 


Failed  100.00  0.00 

Nonf ailed       0 . 4939302F-23         100.00 
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This  tabic  shows  that  ^ibout  zero  percent,  or  rnore  exactly 
0 .  4939302E-23  p(:>rcont  of  obsorvations  of  the  failed  group 
can  be  expected  to  lie  farther  from  the  center  of  the  failed 
group  than  the  moan  of  the  nonf ailed  qrcjup.   Sirailarly,  it 
is  expected  that  zero  percent  cf  the  observations  of  the 
nonf ailed  group  lie  farther  from  their  own  mean  than  the 
mean  of  the  failed  group.   Thus,  there  is  in  fact  no  overlap 
betv/een  the  groups  in  the  test  space,  given  unequal  disper- 
sion matrices. 

Validity  Tests 

To  see  how  valid  are  the  inferences  m.ade  by  the 
above  analysis  as  well  as  to  find  out  how  well  the  model  is 
useful  for  prediction,  two  tests  v;ere  conducted  i  one  with 
the  original  sample,  and  another  with  the  holdout  sample. 
Both  tests  were  also  run  once  with  a  quadratic  classi.fica- 
tion  method  and  once  with  a  linear  classification  procedure. 

Although  the  proper  procedure  is  the  quadratic  one, 
according  to  the  result  of  the  H   test,  inclusion  of  the 
linear  method  seems  to  be  appropriate  for  the  following 
reasons : 

(1)  Comparing  to  the  quadratic  equation,  the  linear 
equation  is  very  simple  and  helpful  in  understanding  the 
procedure. 

(2)  Examination  of  both  methods  is  useful  in  compar- 
ing the  results  which  are  obtainable  from  each  method. 
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(3)  E'or  the  present  case,     the  linear  method  seems 
to  be  as  v.ilid  as  the  quadratic  one.  This  is,  of  course,  due 
to  the  existence  of  large  distances  between  group  means  and 
lack  of  any  sionificaut  overlap  betv;een  the  groups. 

The  equations  used  to  run  the  validity  tests  are 
seen  in  the  next  page.   The  linear  equation  is  based  on 
equation  (18)  of  Chapter  III  and  the  quadratic  one  on  the 
equation  (35)  of  the  same  chapter.   To  construct  these  equa- 
tions there  was  a  need  to  make  some  assumptions  with  respect 
to  the  a  priori  probabilities  of  drawing  an  observation  from 
the  populations  of  failed  and  nonf ailed  firm.s,  and  the  cost 
of  misclassifying  an  observation  from  the  failed  population 
as  from  the  nonfailed  and  that  of  misclassifying  an  observa- 
tion from  the  nonfailed  population  as  from  the  failed.   Tlie 
effect  of  these  assumptions  comes  through  the  cut-off  point 
v/hich  is  adjustable  at  any  time,  and  does  not  seem  to  be  as 
crucial  as  other  features  of  the  technique. 

Regarding  the  a  priori  probabilities,  it  is 
desirable  to  use  the  ratio  of  the  total  number  of  the 
individual  firms  in  both  populations  as  a  proxy  to  the  a 
priori  probabilities.   If  this  method  is  not  practical, 
similar  results  can  be  obtained  by  calculating  the  same 
ratio  from  a  random,  sam.pling  procedure.   Neither  of  these 
methods  was  possible,  however,  for  the  present  study,  bo- 
cause,  as  was  mentioned  previously,    no  comprehensive 
listing  is  available  which  includes  the  names  of  those  firms 


203 


which  have  vc].unt-.r  i  ly  liquidated  or  gone  out  of  business 
v;ith  no  custon^ors'  interest '  i  nvolvcd  . 

The  assuraption  rr.ade  here  is  the  equality  of  the  a 
priori  probabilities  ratio  with  the  ratio  of  the  size  of 
each  group  in    the  classif ica tory  sanple.  For  the  present 
case,  v;here  uliu  group  sizes  are  equal,  this  assumption  is 
equivalent  to  assuming  equal  a  priori  probabilities.   The 
reason  for  such  clioice  is  to  preserve  the  mere  chance  ratio 
which  actually  exists  if  an  observation  is  drav/n  randomly 
from  the  classif icato]-y  J^ample.  3y  doing  this,  the  effect 
of  ir.correct  estimation  of  the  a  priori  probabj.lity  is 
minimized  and  the  validity  of  other  aspects  of  the  model  is 
better  revealed. 

With  respect  to  the  cost  of  misclassif ication  in 
each  group,  there  is  no  firm  foundation,  for  the  present 
case,  upon  v/hich  a  judgment  can  be  based.   It  carnot  be 
determined  with  certainty  from  whose  point  of  view  the  costs 
should  be  considered.   Should  it  be  considered  from  the 
governm.ent '  s  point  of  view,  or  should  the  views  of  any 
other  interested  party,  sucli  as  the  firm,  creditors,  cus- 
tomers, employees,  investors,  other  firms,  or  the  analyst, 
be  taken  into  account?   Even  if  a  decision  can  be  made  in 
this  regard,  the  problem,  is  still  too  complicated  to  be 
solved.   From  the  viewpoint  of  the  government,  for  example, 
is  it  more  equitable  to  let  any  ailing  firm  continue  its 
existence  as  long  as  possible  and  violate  the  principle  of 


the  allocation  of  scarce  resources  by  incorrectly  classify- 
ing the  firm  into  the  healthy  group;  or,  it  is  more  fair  to 
disrupt  a  viable  going  concern  and  endanger  the  principle 
of  the  free  market  enterjrrises  by  makir-ig  an  erroneous  deci- 
sion in  putting  the  firm  in  a  failed  group?   Even  if  a 
decision  can  be  made  in  this  regard,  hov:  can  the  associated 
costs  be  d-ftermined?   Arid,  in  any  v;ay,  the  issue  is  mostly 
a  subjective  matter  in  the  present  case. 

Due  to  the  above  difficulties,  the  costs  of  the 
misclassif ication  in  both  groups  of  firmis  are  assumed  to 

be  equal  here. 
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Classification  of  the  original  sample.     Table  6 

following  shows  results  of  reclassifying  the  observations 
in  the  original  sam.ple  by  both  quadratic  and  linear  pro- 
cedures.  The  classification  is  performed  by  calculating 
the  discrim.inant  score  of  each  observation  and  comparing 
the  result  with  the  specified  cut-off  point.   The  scores 

are  calculated  by  subtracting  x'  from  x  "^  which  are 

2         1 

obtained  by  applying  an  appropriate  procedure  (linear  or 
quadratic).   The  result  is  then  compared  with  0.0  for  the 
linear  equation,  and  -0.88367  for  the  quadratic  equation 
and  classification  is  dene  according  to  the  rules  stated 
under  the  equations.   The  discriminant  scores  calculated, 
and  the  group  assignment  made,  by  the  quadratic  procedure 
have  been  included  in  Appendices  A  and  B  against  each  firm's 


EQUATIONS    USED    FOR    CLASSIFICATIONS    IN    TEST    I 


LiriKAR    A'JD   nUADRATIC    EOUATIONS    IN    TEST    SPACE 
1.       LI-^IEAR    EQUATION    U  (FLD,    NFLD)     = 


VAKIABLE    :<-5 
-0.2  394  7 


VAP'^BLE    X-5  2 
l'.6.10 


VARIABLE    X-6 
-0.22518 


VARIABLE    X-58 
-1.4526 


VARIABLE    X-12 
-9.2115 


VARIABLE    X-69 
62.861 


VARIABLE  X-15 
-0.32805 


VARIABLE  X-78 
0.29287E-01 


VARIABLE  X-18 
4.8457 


VARIABLE  X-81 
0.26021E-01 


VARIABLE  X-22 
70.661 


VARIABLE  X-45 
3.7686 


CONSTANT  1  =  15.29  8 
LOG  OF  P  =  0.0 
2.   OUADRATIC  F:0UAT:0N  II  (FI.D,  NFLD)  = 


Lr!;':AR  ti:pms 

VARIABLE 

X-5 

VARIABLE    X-6 

VARIABLE 

X-12 

VARIABLE    X- 

-15 

VARIABLE    X- 

-18 

VARIABLE    X- 

■22 

X-5 

-l.'.R81 

0.29267r; 

;-oi 

X-6 

-0.')(5394 

0.44991E 

:-02 

0.13724E- 

-01 

X-12 

-18. '.03 

0.52927 

0.47605 

-32.369 

X-15 

0. -1866211 

07 

3.3063 

-0.63285 

38.461 

-0.24332E 

07 

X-18 

0.  186631: 

07 

3.7207 

-0.88441 

21.789 

-0.48666C 

07 

-0.24333n 

07 

X-22 

3018.9 

80.090 

5.5341 

57.020 

-10687. 

-9790.4 

-0.48706E 

0  6 

X-4  5 

42.771 

0.40866 

-0.29114 

27.578 

-75,549 

-83.916 

-659.66 

X-52 

-934.28 

-27.785 

-3.5431 

-165.17 

3286.0 

3892.  3 

0.1113-lE 

06 

X-58 

-30.123 

-0.23955 

0.16740 

-0.57615E- 

■01 

30.339 

35.946 

838.10 

X-69 

-211 .71 

-4.9881 

-0.89518 

-76.344 

447.63 

593.47 

-1263.4 

X-7H 

0  . ')  1  4  5 1 

-0.18065R- 

01 

-0.22220E- 

-01 

-0.71876 

2.0730 

1.  1007 

-3. 1581 

X-81 

-0.70065 

0.1682  3E- 

01 

0.20408E- 

-01 

1.2078 

0.45297 

0.67241E- 

-02 

5.8633 

(SEC' I  ON  2) 

VARIABLE  X-45    VARIABLE  X-52    VARIABLE  X-58    VARIABLE  X-69    VARIABLE  X-78    VARIABLE  X-81 


X-45 

-24.733 

X-52 

543.42 

-57278. 

X-58 

4.1357 

-277.19 

-0.17988 

X-6  9 

115.13 

-2110.3 

-45.416 

-436.69 

X-78 

-0.22507:^-01 

-0.36337 

-0.18460 

1.3013 

0.65116E-01 

X-81 

-0.14948 

5.0863 

-0.19329E-01 

-1.1229 

-0.36298E-01 

0.51666E-01 

2  LOG  OF  P  =  0.0 

CO:;STANT  1  =  -0.24  330E  07 

LOG  OF  CONSTANT  2  =  -0.88367 

P  =  Till:  RATIO  OF  APHIORI  PROBABILITIES  OF  GROUP  J  AND  GROUP  I  AND/OR  COSTS  OF  GROUP  MEMBERSHIP 

FOR  OUADRATIC  EQUATION  CONSTANT  2  =  DETERMINANT  (DISPERSION  MATRIX  OF  GROUP  J) /DETERMINANT  (DISPERSION  MATRIX  OF  GROUP  I) 

OUADPATIC  EQUATIONS  U(I,J)  ARE  READ  AS  A  HALF  MATRIX  OF  COEFFICIENTS  DEFINED  BY  THE  PRODUCT  OF  THE  ROW  AND  COLUMN 
TflRM::.   PLUS  LINEAR  TERMS  STORED  IN  THE  FIRST  COLUMN.   PLUS  CONSTANT  1. 

QUADl-ATIC  CEA^iSIFICATION  RULES  ARE  AS  FOLLOWS  — 

ASSIi::i  TO  GROI^I^  I  IF  FOR  ALL  OTHER  GROUPS  J,  THE  EQUATION  U(I,J)  IS  LESS  THAN  THE  NATURAL  LOG  OF  COrJSTANT  2  MINUS  2 

TIME.'.  THE  NATU[<AL  LOG  OF  P 

LINE.-.R  EOUATIOrjS  ARE  READ  AS  A  VECTOR  OF  COEFFICIENTS  PLUS  CONSTANT  1 

LI'Jr.AR  CLASSIFrCATIflN  RULES  ARE  AS  FOLLOWS  — 

ASSir.N  TO  GROUP  I  II'  FOR  ALL  OTHER  GROUPS  J,  THE  EQUATION  U(I,J)  IS  GREATER  THAN  THE  NATURAL  LOG  OF  P 
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As  socn  in  the  table,  the  linear  equation  has  heon 
successful  in  roclass i f yinq  the  oriqinal  observations  100 
percent  correctly,  v;!ioreas  tlio  quadratic  equation  has  mis- 
classified  one  oDservation  frorv-    the  nonfailod  group  into 

"  c 

the  failed  croup.     Thus,  from  the  total  number  of  60 
observations  only  one  firm,  or  1.67  percent,  has  been  mis- 
classified.   The  misclassi f ication  rate  of  the  NASD  study 
for  the  original  sample,  using  cjuadratic  method,  was  9.9 
percent.   As  seen  in  the  table,  the  cjuadratic  procedure 
showed  a  slight  incl j.nat.ion  to'^ard  the  failed  group.   If  it 
was  large,  dt  could  be  corrected  by  adjusting  the  cut-off 
point  in  such  a  way  that  the  expected  rate  of  misclassi-f ica- 
tion becomes  equal  in  both  groups. 

To  provide  a  vi.sual  representation  of  the  quadratic 
classification  procedure,  a  one-dimensional  graph  has  been 
prepared  which  maps  the  classified  observations  in  accord- 
ance with  their  discriminant  scores  (see  Chart  I) .   The 
observations  graphed  are  letter-coded  according  to  group 
membership.   Letter  F  designates  observations  from  the 
failed  group  (or  group  1) ,  and  letter  N  those  from  the 
nonfailecl  group  (or  group  2). 

Classif icatiori  of  the  holdout  .sample.   As  discussed 
in  Chapter  III,  the  results  of  the  classification  of  the 
original  sample  are  biased  upward.   An  independent  holdout 
sample  provides  a  better  means  for  evaluating  the  dis- 
criminatory or  predictive  pov;er  of  the  model.   Just  like 
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F  indicates  an  observation  from  the  failed  group. 

N  indicates  an  observation  from  the  nonf ailed  group, 

Scale  for  the  graph:   one* =5. 

6  Observations  were  off  the  graph  to  the  left. 

6  Observations  wore  off  the  graph  to  the  right. 


CHART  I.   CLASSIFICATION  OF  THE  ORIGllIAL  SAI'TLE  OF  TEST  I 
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th.G  original  sample,  the  observations  of  this  sample  have 
been  classified  by  using  boLh  linear  and  quadratic  pro- 
cedures.  Tl;  ?  individual  discriminant  scores  of  the.  quad- 
ratic procedure,  along  with  its  group  assignments,  have  been 
included  in  Appendices  C  and  D.   Ti:ie  overall  results  of  the 
classification  are  also  shown  in  Table  7. 

As  is  seen  in  the  table,  the  quadratic  equation  has 
misclassif red  four  firms  from  failed  and  four  firms  from 
nonf ailed  groups.   TJie  linear  equation  has  misclassif ied 
one  observation  less.   Although  the  results  of  the  tv;o 
procedures  are  comparable,  the  firms  misclassi f ied  are  not 
exactly  the  same.   Both  procedures  have  misclassif ied  firms 
No.  21,  26,  27,  and  29  of  the  failed  group.   But,  the 
quadratic  m.ethod  has  misclassif  ied  firms  No.  42,  43,  55, 
and  58  fromi  noufailed  group,  whereas  the  linear  procedure 
has  misclassif ied  firms  No.  36,  42,  and  43.   As  seen  in 
Appendix  C,  two  of  the  misclassif ied  failed  firms  (Nos.  21 
and  27)  have  a  lov7  age,  a  high  last  report  lag  time,  and 
only  tv7o  reports.   Other  firms,  hov/ever,  do  not  show 
obvious  reasons. 

The  quadratic  method  has,  on  average,  misclassif ied 
13.3  percent  of  the  holdout  sample,  and  the  linear  method 
11.6  percent.   As  discussed  before,  the  similarity  between 
the  results  of  the  two  methods  is  due  to  the  large  distance 
between  the  m.eans  and  virtUvally  no  overlap  between  the  two 
groups. 
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ThG  results  of  Lho  quadratic  classification  of  the 
holdout  sample  have  also  ijoen  fjraphed  in  the  same  way  as 
that  for  the  original  saraple  (see  Chart  2). 

I'oten  t  i  a  1  Biases 

There  are  several  possible  sources  of  potential 
biases  in  the  results  obtained.   These  potential  biases 
might  have  been  worked  in  either  upv/ard  or  dov7nv;ard  direc- 
tions: ascribing  a  higher  degree  of  accuracy  to  the  model 
than  what  it  actually  possesses,  or  attributing  a  lower 
level  of  validity  to  the  results  than  what  they  really 
deserve.   The  following  paragraphs  discuss  where  these 
biases  might  have  come  from,,  and  v.'herever  possible,  test 
the  significance  of  their  effects  on  the  resu.lts. 

Potential  biases  due  to  the  sample.   As  v/as  reported 
earlier  in  this  chapter,  it  has  not  been  possible  to  include 
any  non-SIPC  failed  firm  in  the  study.   This  might  have 
introduced  some  biases  into  the  model,   Furtherm.ore ,  the 
limitations  existed  and  the  considerations  made  in  the 
process  of  sample  selection,  as  reported,  might  have  brought 
in  the  effects  of  deviation  from  randomi^.ess . 

To  estimate  the  extent  of  such  biases,  a  test  was 
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run  by  using  the  scrambled  sample  m.ethod.     Although  this 

test  is  more  suitable  for  samples  of  small  sizes,  where  the 
possibility  of  using  the  holdout  sample  method,  as  performed 
above,  is  nil,  its  application  to  the  present  study  seems 
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CHART  II.   CLASSIFICATION  OF  THE  HOLDOUT  S.AMPLE  OF  TEST  I 
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appropriate,  due  to  the  existoncG  of  the  conditions 
described  above,  and  as  an  additional  means  of  checking  the 
validity  of  the  analysis. 

To  run  this  test,  all  the  120  observations  were 
scraip.bled  and  each  of  the  individual  observations  •,vas  as- 
signed to  one  of  the  two  groups  randomly.   Then,  follov;ing 
the  same  methodology  used  to  get  the  original  function,  the 
discriminant  function  for  this  scrambled  sample  v/as  ob- 
tained. "    By  using  thi.s  function  the  entire  group  of 
observations  was  classified.   The  results  of  both  linear 
and  quadratic  procedures  were  47.5  percent  misclassif ica- 
tion. 

Since  the  observations  used  in  computing  the  func- 
tion were  assigned  to  the  groups  randomly,  it  was  expected 
that  the  discrim.inatory  power  of  the  model  would  be  zero. 
Equivalently ,  it  was  expected  that,  by  pure  chance,  50 
percent  of  the  observatic^ns  are  classified  incorrectly. 

To  determine  the  significance  of  the  difference 
between  the  actual  result,  4  7.5  percent,  and  the  expecta- 
tion from  pure:  chance,  50  percent,  a  t  test  was  conducted 
by  using  equation  (70)  of  Chapter  III,  with  the  following 
results. 

t  -  (Q-P)/a   =  (0.475  -  0  .  500 ) /O . 04 5643 1  =  -0.55 
The  calculated  value  of  t  indicates  that  the  difference 
between  the  results  of  the  model  and  chance  is  not  sig- 
nificant at  any  level  lower  th>an  0.6  (the  tabulated  value 
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of  t  at  the  0 .  (:■  level  is  0.524).   In  other  words,  if  there 
has  r)ec-n  any  sar:plc  bias  m  tlie  n^odel  ,  it  has  not  most 
probably  been  significant. 

As  a  raatter  of  possible  interest,  the  result  of  a 
slightly  different  approach  to  the  problem  is  also  reported. 
This  approach  is  based  on  the  logic  of  the  scrambled  sample 
method,  but  in  a  reversed  order.   Rather  than  scrambling 
the  observations  first  and  deriving  the  function,  and  then 
applying  the  function  to  the  observations  in  their  actual 
groupings,  the  modified  approach  derives  the  function  first 
from  the  observations  in  their  actual  groupings,  and  then 
applies  the  function  to  the  scrambled  observations.   If 
the  model  does  not  have  any  sample  biases  it  should  not  be 
able  to  classify  the  scrambled  observations  any  better  than 
by  pure  chance.   The  advantage  of  this  modified  approach  to 
the  original  one  is  the  elimination  of  the  costs  associated 
with  deriving  the  function  from  the  scrambled  observations, 
because  rhe  other  function  would  have  to  be  computed  anyway. 

As  was  expected,  the  result  of  the  modified  approach 
was  al.nost  exactly  the  same  as  that  of  the  original  one. 
That  is,  the  12-\'ariable  function  derived  from  the  observa- 
tions in  their  actual  groupings  miisclassif ied  46.7  percent 

of  the  scrambled  observations  with  the  quadratic  procedure 

21 
and  50  percent  with  the  linear  one.     The  t  value  for  the 

quadratic  method  is  -0.72  which  is  again  not  significant  at 

any  level  lovv'er  than  about  0.5. 
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Potential  biases  duo  to  data.   In  the  section  on 
"Data  Sot,"  it  v.'as  roportod  that  tho  study  had  run  into  tho 
difficulty  of  niissing  data  duo  to  the  existence  of  several 
factors  named  therein.   It  was  also  stated  that  in  solving 
this  problem  the  justified  method  of  mean  substitution  was 

utilized.   Although  the  superiority  of  this  method  (and 
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principal  component  method)  has  been  shown  experimentally, 

the  exi.stence  of  some  biases  caused  by  this  factor  cannot 
be  denied. 

To  estimate  the  extent  of  these  biases  it  is 
desirable  to  derive  a  discriminating  model  which  docs  not 
contain  any  m.issing  values  among  its  data  set. 

If  all  tiie  firms  having  one  or  more  missing  values 
in  their  data  sets  are  om.itted,  the  total  number  of  firms 
which  rem.ains  from  the  original  GO  failed  firms  would  be 
19.   The  remaining  number  from  the  original  60  nonf ailed 
firms  would  be  39.   If  all  the  data  item.s  having  one  or 
more  m.issing  values  in  their  related  data  sets  are  omitted, 
the  total  number  of  omitted  variables  would  be  31.   If, 
however,  the  tv;o  methods  are  coniprcmised  by  om.itting  some  of 
the  data  items  in  which  the  mis^^ing  values  most  frequently 
occurred  and  then  eliminating  the  firm^s  with  missing  values 
in  the  remaining  data  item.s,  a  better  result  may  be  obtained 

Following  th.is  procedure,  eight  data  items  were 

eliminated  first,  resulting  in  the  elimination  of  11 

2  3 
variables.  '   In  fact,  the  number  of  variables  omitted  in 
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this  reaard  would  be  six,  becaure  the  other  five  variables 

would  be  eiiiniiia'ced  due  to  the  correlations,  ap.yv;ay.   But, 

24 
from  these  six  variables,    tv/o ,  X-2  2  and  X-7  8,  are  ones 

which  have  been  among  the  selected  variables  in  Test  1. 

Then,  all  the  firms  having  one  or  more  missing 

values  j.n  tl;e  remaining  data  iteras  were    omitted.   This  kept 

2S 
22  failed  firms  and  10  nonfailed  firms    out  of  the  analy- 
sis. 

With  the  remaining  38  failed  firms,  50  nonfailed 

firms,  and  55  variables,  and  using  a  similar  m.ethodology 

2  f 
as  the  one  used  in  Test  1,    a  set  of  12  variables  was 

27  .  ,    .        . 

selected    and  the  related  linear  and  quaarata c  equations 

were  derived  (see  /appendix  J)  . 

As  seen  in  Appendix  I,  half  of  the  selected  variables 
are  the  same  variables  as  in  Tost  1.   It  may  also  be  of 
interest  to  note  that  7  of  the  12  variables  are  either  in 
the  form  of  the  coefficient  of  variation  or  the  standardized 
deviation  from  a  disaster  level. 

The  signs  of  the  discriminant  coefficients  of  the 
six  comjnon  variables  and  their  economic  analysis  are  con- 
sistent with  those  of  Test  1.   The  coefficients  of  tlie 
other  six  variables  are  also  congruent  'with  what  has  been 
expected.   That  is,  the  average  failed  firm  has  shown  lower 
level  of  retained  earnings,  higher  level  of  cash  and  de- 
posits, lower  level  of  equity  capital  plus  subordinated 
debt,  higher  level  of  failed  to  receive,  larger  varia- 
tion in  failed  to  deliver,  and  larger  variation 
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in  total  liciDi],  ;i  I  ic-s  to  equity  ca£'ital  ratio  than  those  of 
its  nonf ailed  counterpart. 

The  centroids  of  the  failed  and  nonf a.i  led  groups 
v;ere,  respectively,  0.54S59  and  1.04588,   The  result  of  the 
H,  test  of  dispersion  matrix  equality  was  5.83,  which  is 
significant  at  the  0.0  percent  level.   The  11   test  of 
equality  of  group  means  resulted  in  an  F  statistic  equal 
to  10.20  v;hich  is  significant  at  the  0.61846E-10  percent 
level.   And  the  expected  overlap  of  groups  in  the  test 
space  v/as  alruost  zero,  given  unequal  dispersion  matrices. 

The  quadratic  and  linear  classification  procedures 
respectively  reclassified  the  original  observations  v/ith 
11.36  percent  and  9.09  percent  rate  of  mj.scl  assif  ication 
(see  Appendix  K) .   These  compare  to  1.67  percent  and  0.0 
percent  in  Test  1,  and  to  9 . 9  percent  in  the  NASD  study 

for  the  quadratic  procedure.   The  Lachenbruch  classifica- 

2  fl 
tion  method,    hov/ever,  classified  the  observations  with 

15.91  percent  misclassif ication  by  the  quadratic  and  10.23 
percent  by  the  linear  procedure  (see  Appendix  L) .   The 
corresponding  value  of  the  quadratic  procedure  in  the 
NASD  study  v.'as  13.8  percent. 

As  was  noted  above,  the  results  of  the  classifica- 
tion of  the  reduced  sample  which  does  not  contain  any  mis- 
sing data  are  not  any  better  than  those  obtained  in  Test 
1.   In  other  words,  if  any  bias  has  been  introduced  into 
the  m.odel  by  using  the  mean  substitution  ip.cthod  in 
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treatino  the  missing  data,  this  bias  has  probably  not  been 
signif  ican-c. 

Potential  biases  due  to  the  va riatio n_ i n  reporting 
?At±?.-   I"  the  section  on  "Source  of  Data,"  it  was  stated 
that  the  time  period  elapsed  betV'/een  two  financial  reports 
of  the  firms  ranged  from  5  to  21  month.s.   It  was    thought 
that  this  variation  might  have  some  negative  effects  on  the 
comparability  of  data  both  for  the  same  firm  during  differ- 
ent years  and  for  different  firms  during  the  sam.e  year. 

An  effort  v/as  made  to  remove  this  variation  by 
"normalizing"  the  deviated  cases.   Accordinqly,  the  follov;- 
ing  formula  v/as  used  to  assign  nev,'  values  to  the  data  items 
of  the  same  firm,  in  i.ts  reports  of  the  years  prior  co  the 
last  one,  taking  the  last  report  as  the  base. 

H-1    -^^      ETP      +  ^t' 

where  t  is  the  base  year,  Y^  is  the  new  value  of  the  data 
item  in  the  year  prior  to  the  base  year,  X^  is  the  value  of 
the  item  in  the  base  year,  X^_^  is  the  old  value  of  the  item 
in  the  year  prior  to  the  base  year,  and  ETP  Is    the  elapsed 
time  period  betv/een  the  two  reporting  dates. 

The  fraction  on  the  right  side  of  the  above  equation 
computes  the  average  change  of  the  value  of  the  data  item 
during  one  month.   When  this  change  is  multiplied  by  12,  the 
total  change  during  12  months  is  obtained.   By  adding  this 
total  change  to  the  base  value,  X. ,  the  new  value  of  the 
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item  results.   This  proccuuro  is  ropoated  for  all  the  data 
itens  in  the  report  until  th'.^  entire  report  of  the  previous 
year  is  "nor;na]  izcd .  "   Then,  the  norra?,lized  report  is  taken 
as  the  base  and  the  data  items  in  the  report  prior  to  the 
normalized  one  are  treated  in  the  same  way. 

Using  these  normalized  data,  the  entire  process  of 

29 

Test  I  was  repeated,  and  12  variables  wore  selected.     From 

am.ong  these  variables  8  are  the  same  as  those  selected  in 
the  original  experiment  and  two  of  them,  X-41  and  X-G8,  are 
similar  to  X-69  and  X-22,  respectively,  which  were  selected 
in  the  original  model  to  deal  with  trading  and  investm.ent 
accounts  and  partly  secured  and  unsecured  customers' 
securities  accounts.   The  remaining  two  variables  are  X-28 
and  X-73,  dealing  with  failed  to  deliver  outstanding  30 
days  or  more  and  cash  accounts,  respectively. 

The  coefficients  of  both  X-28  and  X-73  in  the 
related  discriminant  function  are  negative.   This  signifies 
that  the  higher  values  of  these  variables  are  associated 
with  the  failed  firms.   In  other  words,  the  failed  firms, 
on  average,  have  shown  a  greater  percentage  rate  (with 
respect  to  total  assets)  of  securities  failed  to  deliver 
30  days  or  more  than  the  nonfailed  firms  (this  is  not 
contrary  to  what  is  usually  expected) .   They  have  also 
dem>onstrated  a  higher  percentage  of  cash  to  total  assets 
than  their  nonfailed  counterparts.   This  sets  one  to 
thin};ing  that  the  failed  firms  may  not  have  utilized  their 
available  cash  supplies  appropriately  and  have  kept  them 
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idle  in  oxcesp  of  their  clay  to  day  requir;?n-:onts . 

When  the   validity  of  the  "norir.alizcd"  niodol  was 
examined  by  the  original  san|;le  method  and  the  holdout 
sample  method,  it  was  found  out  that  the  model  did  not 
perform  any  better  than  the  ori.ginal  one.   In  fact,  the 
reverse  vjas  true.   The  results  of  these  tests  are  shown  in 
Appendices  M  and  N. 

As  is  seen  in  these  appendices,   the  quadratic 

30 
procedure    of  th,e  normalized  model  classifitd  96.7  percent 

of  the  original  and  80  percent  of  the  holdout  sample 

correctly.   These  compare,  respectively,  with  those  of 

98.3  percent  and  86.7  percent  correct  classification  in  the 

original  model.   Although  the  result  of  the  riorm.alized 

31 
mcodel  is  also  encourfiging ,    its  perf  ormiance  ,,  in  terms  of 

the  rate  of  m.isclassif ication  of  the  holdout   sample,  is 

not  as  good  as  that  of  the  original  m.odel .   This,  of  course, 

is  not  V7hat  was  expected. 

The  lower  discriminatory  power  of  the  norm.alized 
model  may  lead  one  to  the  conclusion  that  no  bias  exists 
due  to  the  variation  in  reporting  dates.   This,  however, 
may  be  a  misinterpretation  of  the  results.   A  possible 
factor  responsible  for  the  result  is  the  procedure  used  for 
normalizing  the  data  items. 

The  implied  assumption  in  the  above  procedure  is 
that  the  changes  in  the  data  item,  from  the  previous  year  to 
tlie  next  is  m.onotone,   and,   therefore,  the  principle  pf 
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proportionality  holds.   Although  justifiod  in  this  situation 
where  there  is  no   alternative,  thjs  assumption  may  be  a 
little  weak. 

Potential  biases  due  to  the  variable  selection.   This 
point  was  also  mentioned  during  the  description  of  the  metho- 
dology.  The  large  nuinber  of  variables  in  relation  to  the 
small  number  of  available  observations  necessitated  that  the 
selection  process  be  mingled  with  some  heuristic  procedure. 
The  result  was,  of  course,  the  construction  of  a  model  which 
can  not  be  guaranteed  to  be  the  best  one,  although  optimal 
under  the  existing  set  of  constraints.   The   possible  bias 
in  this  respect  is,  therefore,  on  the  negative  direction. 
That  is,  if  it  has  not  been  subjected  to  the  restraints, 
especially  with  respect  to  the  number  of  observations,  the 
model  might  have  produced  a  better  result.   The  limitation 
in  the  number  of  reports  on  each  firm  may  also  be  a  source 
of  this  kind  of  bias.   This  problem  prevents  the  m.eans  and 
standard  deviations  of  several  data  items,  on  which  m.any 
variables  have  been  based,  from  becoming  fully  operative. 

Results 

The  efforts  directed  toward  building  a  MDA  m.odel  fcr 
analyzing  and  predicting  the  failure  of  the  securities 
brokerage/dealer  firms  from  about  a  year  in  advance  resulted 
in  a  discriminant  function  consisting  of  12  variables  which 
measure  the  profitability,  risk  exposure,  and  mismanagement 
of  the  firm.   The  model  separates  the  two  groups  of  failed 
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and  nonfailed  firrr'is  in  such  a  v/ay  that  thorc  is  virtually 
no  overlap  between  tiiem  and  the  distance  betv;een  the  group 
moans  is  highly  significant. 

The  model  vvas  very  helpful  ii'i  describing  the  attri- 
butes of  the  failed  and  nonf ailed  firms.   The  sample  firms 
showed  the  association  of  ].ower  ages  of  a  firm  and  its 
president  with  failure.   They  also  demonstrated  that  non- 
failed  firms,  on  average,  have  relatively  greater  retained 
earnings  to  total  assets  and  larger  equity  capital  plus 
subordinated  debt  to  total  assets.   The  interest  rates  in 
the  prior  year  have  also  had  a  positive  effect  on  the  non- 
failed  firms.   The  relations  betvjeen  failure  and  percentage 
changes  in  equity  capital,  percent  of  the  short  positions 
in  the  trading  and  investment  accounts,  and  the  coefficient 
of  variation  of  the  ratio  of  the  trading  and  investm.ont 
accounts  to  total  assets  were  shov.-n  to  be  positive.   This 
direct  relation  also  exists  for  variables  dealing  with 
partly  secured  and  unsecured  customers'  securities  accounts, 
total  liabilities  minus  subordinated  debt,  and  property 
and  fixtures  management. 

Assuming  tnat  the  classification  of  the  selected 
variables  into  the  throe  elem.ents  of  the  hypothesized 
function,  i.e.,  profitability,  ri£-,T';,  and  mism>anagem>ent ,  is 
correct,  the  profitability  variables  account  for  33.66 

3  9 

percent   of   the   discriminatory   power   of   the   riodel.  The 

contributions    of    risk   and   raismanagoment   variables    are, 
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respectively,  52,6"  percent  and  13.70  percent.   This 
indicates  that  in    the  short  run  risl:  element  is  the  most 
significant  deter'ni  nant  of  the  firm's  survival.   The 
importance  of  profitability  is  next  and  that  of  the  mis-- 
management  is  last. 

The  accuracy  and  validity  of  the  model  were 
examined  by  several  methods.   The  very  low  percentage 
(only  1.67  percent)  of  the  misclassif ication  of  the  original 
sample  indicated  that  the  model  is  significantly  accurate. 
The  exposure  of  the  model  to  the  holdout  sample  was  used 
as  a  means  of  both  the  predictive  ability  and  the  validity 
of  the  model.   Its  discriminatory  power  was  shown  to  be 
relatively  high  by  correctly  classifying  about  86.7  percent 
of  tlie  holdout  sample.   Finally,  the  scrambled  m.ethod, 
the  Lachenbruch  method,  and  the  "normalized"  model  did  not 
indicate  that  the  potential  biases  in  the  model  are  signifi- 
cant . 

Test  II.   Tvjo  Years  -in  Advance  o f  Bankruptcy 

Introduction 

The  model  presented  in  the  previous  experiment  is 
the  outcome  of  the  effort  made  toward  analysing  and  predicting 
the  financial  position  of  a  brokerage/dealer  firmc,  based 
upon  the  inf  ormat  .i  on  obtainable  from  the  firm's  last 
financial  report  which   was   issued  about  a   year 
prior  to  bankruptcy.   The  natural  extension  of  such  endeavor 
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is  an  attempt  to  build  up  another  r.-.oiel  cap-ible  of  serving 
the  same  purpose  for  two  years  in  advance  of  a  potential 
bankruptcy.   That  j.s,  the  analysis  and  prediction  should 
be  based  on  information  obtainable  from  the  firm's  ne::t- 
to-the-last  financial  report. 

Altliough  liavj  ng  such  a  model  is  highly  desirable, 
the  expectation  from  the  model  is  usually  too  much  to  be 
materialized  for  at  least  the  next  fev^  years,  v;hen  the 
economy  may  become  m.ore  under  control.   The  reason  for 
inability  to  build  a  highly  successful  model  is  inherent 
in  the  nature  of  the  securities  business.   Unlike  miost 
other  industries,  the  securities  industry  is  more  subject 
to  changes  in  the  economy  and  the  danger  of  failure  is 
levered  by  the  very  high  financial  leverage  (recently  1  to 
12)  which  is  a  going  practice  in  that  industry.   A  firm 
v/hich  is  in  a  sound  position  today  may  be  defunct  tomorrow! 
Every  major  downward  change  in  the  prices  of  the  securities 
brings  down  with  it  several  brokerage/dealer  firms.   As  v/as 
mentioned  earlier  in  this  chapter,  of  109  SIPC  failed  firms 
about  4  0  percent  did  not  have  a  char-ce  to  survive  long 
enough  to  produce  even  two  financial  reports. 

With  this  caveat  in  mind,  it  is  not  expected  that  a 
model  using  the  financial  information  of  two  years  prior  to 
bankruptcy  would  be  highly  successful.   On  the  contrary,  it 
seems  reasonable  to  expect  that  every  bit  of  information 
obtainable  from  such  a  less-than-successf ul  m.odel  makes  the 
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effort  wor  thv.'hi  lo  .  This  st  atc-nion  t  is  particularly  true 
for  the  5-:;os£r.t  study  which  usos  only  raijou  t  half  of  the 
pertinent  financial  information,  because  the  other  half 
v/as  inaccessible. 

Variable  Set 

The  variable  set  used  in  this  experiment  is  a 
subset  of  that  used  in  Test  I.   The  variables  omitted  from 
the  original  set  are  those  incorporating  the  standard  devia- 
tion of  some  of  the  data  items.   Specifically,  variables  ex- 
cluded are  X-50,  X-51,  and  X-63  through  X-S2.  The  reason  for 

such  treatment  is  the  availability  of  only  tv/o  financial  re- 

3  3 

ports  for  22  of  the  60  failed  firms,    Vv'hich  leaves  only  one 

financial  report  of  such  firms  for  the  present  test.   For  the 

same  reason  variables  dealing  v/ith  the  changes  of  data  items 

from  one  year  to  another  were  also  eliminated.   Finally, 

a  number  of  additional  exclusions  of  variables  arose  from  the 

correlation  between  some  of  them.   In  all,  from  thie  original 

set  of  83  variables,  only  29  variables  remained  to  be  Indu- 
s' 
ded  in  the  present  test. 

Discriminating  Analysis 

The  analysis  presented  in  this  tost  is  also  based 
on  the  analytic  framev/ork  discussed  in  Chapter  II,  and  the 
MDA  technique  described  in  Chapter  III. 

Methodology.   The  methodology  used  here  is  the  same 
as  that  employed  in  Test  I.   The  only  difference  is  the  re- 
quired adjustm.ent  made  in  the  process  to  match  the  reduction 
in  the  number  of  variables. 
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The  ficloctod  variciblcs.   Tho  culinination  of  ujing 

3  set  of  29  variables  and  exposing  the  financial  data  of  30 

35 
failed  and  3  0  nonfoiled  fir^.s    of  the  original  sanple  to 

the  MDA  technique  was  the  selection  of  12  variables  whose 
names  and  pro::r:rties  appear  in  Table   8,     From  among 
these  variables  four  are  the  sam.e  ones  selected  in  Test  I, 
and  the  rest  have  some  similarity  to  those  of  the  previous 
test. 

To  facilitate  the  final  conclusion  Lo  be  drawn 
from  this  analysis,  it  is  useful  to  consider  each  of  these 
variables  in  its  place  in  one  of  the  three  categories 
corresponding  to  the  three  elements  in  the  hypothesized 
function.   Again,  in  some  cases,  there  is  no  clear-cut  line 
between  these  categories,  and  a  variable  may  appear  to 
belong  to  more  than  one  category.   But  it  is  desirable  to 
have  the  variable  in  the  category  where  its  discriminatory 
power  seems  to  be  overriding. 

Profitability.   Among  the  12  variables  there  are  3 
which  fall  in  this  category:   X-5,  X-11,  and  X-49.   X-5, 
the  age  of  the  firm,  has  previcus].y  been  discussed.   The 
relation  between  X-11,  the  ratio  of  retained  earnings  to 
equity  capital  plus  subordinated  debt,  and  profitability 
is  similar  to  X-12  which  v/as  discussed  in  Test  I.   And, 
X-49,  the  percentage  change  in  GNP  is  an  external  factor 
which  also  affects  the  profitability. 

Risk.   The  variables  assigned  to  this  category. are 
X-15,  X-17,  X-33,  X-41,  and  X-52.   X-15,  the  ratio  of  equity 
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capital  pj.us  subordinated  debt  to  total  assets,  and  X-52, 
the  percentagf^  of  tradinr;  and  investment  accounts  v/ith 
credit  balances,  have  a] ready  been  discussed.   X-17  is 
equival^ent  to  the  traditional  debt-to-equity  ratio  and 
requires  no  discussion.   X-41,  which  is  the  argument  of 
X--69,  has  been  covered  during  the  discussion  of  the  latter 
in  Test  I, 

With  respect  to  X--3  3,  which  is  the  ratio  of 
customers'  securit.i.es  secured  accounts  to  total  assets,  it 
seems  reasonable  to  expect  that  a  firm  vjhich  has  a  relatively 
higher  level  of  secured  accounts  bears  less  risk  than  the 
one  having  a  lov/er  level,  other  things  being  constant.   This 
variable,  however,  may  not  effectively  show  the  rel.ative 
risk.   A  better  measure  may  be  the  one  v/hich  sliows  the 
percentage  of  the  secured  account  in  relation  to  the 
accounts  receivable  from  customers  rather  than  to  total 
assets,  v;hich  is  the  case  with  X-33.   Creating  such  a 
variable,  however,  was  not  feasible,  because  some  observa- 
tions had  zero  accounts  receivable  from  custom.ers  and  this 
would  make  the  ratio  undefined. 

Mismanagement.   The  remaining  four  vi^riables,  i.e., 
X-6,  X-9,  X-29,  and  X-30,  fall  in  this  category.   X-6,  the 
age  of  the  firm's  president,  was  discussed  in  Test  I.   X-9, 
the  ratio  of  cash  plus  other  deposits  to  total  assets,  is 
similar  to  X-7  v;hich  was  covered  in  the  discussion  of  X-73 
in  the  Lachenbruch  test.   And  X-29,  the  ratio  of  failed  to 
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deliver  and  receive  over  30  days  tc  total  assets,  is 
similar  to  X-2B  v;hich  was  discussed  in  the  test  with 
"normalized"  data. 

Reaardina  X-30,  the  ratio  of  the  accounts  receiv- 
able from  customers  to  total  assets,  an  argument  can  be 
made  for  j.ts  inclusicm  among  the  profitability  factors 
since  the  easier  the  credit  is,  the  more  sales  result, 
and,  np  to  a  point,  the  more  profit  is  earned.   But,  in 
the  securities  business,  where  extraordinary  credit  con- 
cession does  not  seem  to  be  necessary  in  stimulating  sales, 
a  relatively  large  ratio  of  accounts  receivable  from 
customers  to  total  assets  may  signify  the  inability  of  the 
firm  to  collect  its  receivables  in  time. 

H,  test  of  equality  of  dispersion  matri.ces.   As 
before,  this  test  v/as  run  to  determine  whether  the  linear 
or  the  quadratic  procedure  is  the  appropriate  method  to 
be  used  in  constructing  the  discriminant  function  and  the 
related  classification  rule.   The  result  of  the  test,  which 
is  based  on  an  F  statistic  v;ith  first  degree  of  freedom 
equal  to  78  and  second  degree  of  freedom  equal  to  10622.8, 
shows  a  value  of  2.85  for  F  (significant  at  the  0.78597E-13 
percent  level)  indicating  that  the  dispersion  matrices  are 
not  equal  and  the  proper  procedure  is  the  quadratic  one. 

Ti.e  discriminant  function  and  its  analysis.   The  12 
variables  just  explained  were  used  by  the  computer  program 
to  build  up  the  linear  and  quadratic  functions.   Following 
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the  same  procedure  used  in  Test  I,  the  inferential  use  of 
the  MDA  technique  for  the  present  test  is  also  made  via 
the  linear  function.   The  quadratic  procedure  will  be  used, 
however,  for  the  predictive  purposes.   The  Linear  classifi- 
cation results  will  also  be  reported  for  additional  informa- 
tion and  comparison. 

Table   8   shows   the  coefficients  of  the  linear 
discriminant  function.   The  contents  of  this  table  match. 
those  of  Table  4  in  Test  I.   To  infer  the  role  and  the 
relative  importance  of  each  variable  in  the  function,  the 
sign  of  its  coefficient  and  the  percentage  of  its  discrimi- 
natory power  is  analyzed. 

To  begin  with,  it  should  be  noted  that  the  discrimi- 
nant mean  (or  centroid)  of  the  failed  group  in  reduced  space 
equals  -0.02  and  that  of  the  nonf ailed  group  equals  0.16. 
This  indicates  that  firms  having  larger  values  for  the 
variables  with  negative  coefficients  tend  to  make  lower 
discriminant  scores  and  fall  in  the  failed  group,  whereas 
the  reverse  is  true  for  the  nonf ailed  group. 

As  seen  in  Table   8,   the  properties  of  X-5  and 
X-6  in  this  test  are  almost  the  same  as  those  in  the 
previous  test.   Therefore,  there  is  no  need  for  further 
discussion  for  these  variables.   With  respect  to  X-9,  its 
coefficient  is  negative  indicating  that  the  failed  firms 
have  relatively  more  idle  cash  and  other  deposits  than  their 
nonf ailed  counterparts.   The  difference  between  the  means 
of  this  variable  in  the  two  groups,  although  small. 
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corroborates  this  indicatjon.   This  variable,  by  contri- 
buting about  12  percent  of  the  discriminatory  power  of  the 
function,  ranks  first  amor.g  other  variables.   It  is, 
however,  interesting  to  note  that  its  univariate  F  racio 
is  next  to  the  lowest  one  and  is  not  even  significant  at 
the  0.0  5  levr-l  . 

X-11  is  in  the  second  place.   Its  coefficient  is 
positive,  indicating  that  the  smaller  ratios  of  retain.ed 
earnings  to  equity  capital  and  subordinated  debt  arc 
associated  with  the  failed  firms.   The  smallness  may  come 
through  one  or  a  combination  of  three  sources:   smaller 
retained  earnings,  larger  equity  capital,  and  greater 
subordinated  debt.    The  negative  mean  value  of  the  variable 
in  the  failed  group,  hov;ever,  rem.oves  this  doubt  and 
indicates  that  retained  earnings  is  the  overriding  factor. 

X-15  and  X-52  show  similar  behaviors  in  this  test 
as  they  did  in  the  previous  test.   The  difference  is, 
however,  in  their  relative  importance  both  on  multivariate 
and  univariate  basis. 

The  negative  coefficient  and  other  properties  of 
X-17  which  is  equivalent  to  the  traditional   debt-to-equity 
ratio,  is  exactly  consistent  with  what  is  usually  expected 
from  such  a  variable.   It  is  both  significant  on  the 
univariate  basis  and  in  combination  with  other  variables. 
By  contributing  about  10  percent  of  the  discriminatory 
power  of  the  function,  it  ranks  four  among  other  variables. 
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X-29  q]so  has  a  negative  coefficient  indicating 
that  the  failed  firms  in  tl.e  sample  have  had,  on  average, 
relatively  larger  failed  to  deliver  and  failed  to  receive 
accounts  outstanding  30  days  or  more.   The  inability  of 
failed  firms  to  manage  their  accounts  in  timiC  is  a  factor 
which  is  usual ].y  expected. 

X-30  and  X-33  are  the  only  tv;o  variables  whose 
behavior  in  the  present  experiment  can  not  be  interpreted 
unambiguously.   The  possible  reasons  for  such  a  sitviation 
may  be  the  existence  of  a  relatively  high  correlation 
(0.72)  becv/een  them,  their  relatively  large  standard 
deviations,  and  the  flav\?  in  X-33  with  respect  co  the 
inappropriateness  of  its  denominator  as  discussed  in  the 
risk  classification  of  the  variables,  above. 

The  behavior  of  X-41  is  similar  to  X-52  as  discussed 
in  the  previous  test.   V/ith  its  negative  coefficient  it 
associates  failure  with  more  reliance  on  the  trading  and 
investnient  activities.   This  association  is  consistant  with 
common  sense,  if  the  variable  is  considered  as  a  measure 
of  risk.   But  the  mean  values  of  the  variable  in  the  two 
groups  do  not  corroborate  this  indication. 

Finally,  X-49,  shows  that  failure  has  an  inverse 
relation  with  percentage  changes  in  GNP.   The  greater  the 
change  in  GNP,  the  larger  becomes  th.e  probability  of 
failure.   As  shown  in  the  table,  the  failed  firnis  in  the 
sam>ple  have,  on  average,  seen  relatively  higher  changes  in 
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GNP  during  their  next  to  last  year  of  operation  than  their 
nonf ailed  counterparts. 

The  differences  between  the  group  means.   The  same 
two  statistical  tests,  Rao's  and  Hotelling's,  described 
in  the  previous  experiment,  were  utilized  in  the  present 
experiment.   Both  tests  yielded  a  value  of  17.31  for  the  F 
statistic  with  12  and  47  degrees  of  freedom,  which  is 
significant  at  the  0-23017E-10  percent  level.   The  results 
of  these  tests  indicate  that  the  failed  and  nonfailed 
groups,  as  differentiated  by  the  attributes  corresponding  to 
the  12  selected  variables,  are  sepaT/ate  from  each  other,  and 
the  distance  between  their  means  is  significantly  large. 

The  overlap  between  groups.   As  discussed  in  Test  I, 
the  following  table  can  be  used  to  show  the  extent  of  the 
group  overlap. 

TABLE  9  .   THE  OVERLAP  BETWEEN  GROUPS  IN  TEST  II 


Group  Failed        Nonfailed 

Failed  100.00         7.16542 

Nonfailed      0.3946710E-2       100.00 

Given  unequal  dispersion  matrices.  Table  9  shows  that  about 
zero  percent  of  the  observations  of  the  failed  group  can 
be  expected  to  lie  farther  from  the  failed  group  than  the 
mean  of  the   nonfailed  group.   Similarly,  it  is  expected 
that  about  7  percent  of  the  observations  of  the  nonfailed 
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group  ] ie  farther  from  thnir  own  mean  than  the  mean  of  the 
failed  group.  Thus,  there  is  some  overlap  between  the  two 
groups  in  the  test  space,  but  it  is  not  large. 

Validity  Tests 

In  Test  I,  a  relatively  detailed  discussion  v;as 
made  of  the  reasons  for  and  methods  of  conducting  validity 
tests.   This  discussion,  as  v.'ell  as  the  assumptions  made  in 
regard  to  the  a  priori  probabilities  and  misclassif i.cation 
costs,  holds  true  for  the  present  experiment.   Except 
where  otherwise  specified,  the  same  procedure  employed  in 
the  previous  test  is  followed  here. 

Classification  of  the  original  sample.   By  using  the 
computed  linear  and  quadratic  equations,  along  with  their 
related  classification  rules  (see  next  page) ,  the  observa- 
tions in  the  original  sample  v/ere  reclassified.   The  results 
of  this  classification,  v;hich  indicate  the  accuracy  of 
the  model,  are  presented  in  Table  1.0,  below.   This  table  shows 
that  the  quadratic  equation,  v/hich  is  m.ore  relevant  in  the 
present  case,  has  misclassif ied  only  two  observations  from 
the  failed  group  and  none  from  the  nonf ailed  group.   This 
is  equivalent  to  an  average  rate  of  misclassif ication  of 
about  3.33  percent.   This  rate  for  the  linear  equation  is 
1.67  percent. 

Classification  of  the  holdout  sam.ple.   Application 
of  the  linear  and  quadratic  equations  to  the  observations 
of  the  holdout  sample  in  this  test  did  not  turn  out  results 


EQUATIONS  USED  FOR  CLASSIFICATION  IN  TEST  II 


M.'JF:AR  and  quadratic  IOUATIONS  in  TEST  SPACE 

].   LINCAR  EOlJATIor.'  U(FLD,  NFLD)  = 

VARIABI-r  X-5     VARIABLE  X-6     VARIABLE  X-9     VARIABLE  X-11    VARIABLE  X-15    VARIABLE  X-17 
-0.33843        -0.25433  15.548         -4.8483         -8.5959  1,1727 

VARIABLE  y.-20         VARIABLE  X-33    VARIABLE  X-41    VARIABLE  X-52    VARIABLE  X-49 
-12.367  21.259  8.2973  32.397  87.518 

CONSTANT  1  =  7.0491 

LOG   OF    P    -=    0.0 


VARIABLE    X-2  0 
13.831 


2.       QUADRATIC    EQUATION    U(FLD,    NFLD)     = 


LINEAR    TI  RMS 

VARIABLE    X-5 

VARIABLE    X-6 

X-5 

3.9405 

0.58244E- 

-01 

X-6 

0.  11802 

0.18144E- 

-01 

0.13666E-01 

X-9 

-149.80 

-1.9399 

-1.1617 

X-11 

22.911 

0.55451 

0.27448 

X-15 

157.20 

-4.9341 

0.33460E-01 

X-17 

2  3.669 

-0.62835 

-0.11987E-01 

X-29 

-142.74 

1.9329 

0.22187 

X-30 

97.497 

2.2121 

0.98080 

X-3  3 

-97.485 

-4.5184 

-1.1800 

X-41 

-74.445 

0.86445 

-0.18767 

X-52 

440.06 

-20.551 

-3.2158 

X-49 

-i:'04.  1 

-6.1724 

-9.3333 

VARIABLE    X-9 


57.880 
1.0504 
78.610 
16.511 
130.80 
-73.125 
122.01 
56.429 
426.25 
1015.3 


VARIABLE  X-11 


-8.5529 
-27.080 
-5.2408 
-58.426 
68.403 
-67.254 
0.88471 
-0. 356G0 
-84.723 


VARIABLE  X-15    VARIABLE  X-17 


-75.071 
-33.977 
-12.598 
■140. 32 

119.79 
■25.030 
-524.67 

674.47 


-3.8914 
1.2036 

-11.493 
18.106 

0.50330 

-6G.906 
151.15 


(SECTION 

2) 

VARIABLE 

X- 

-29 

VARIABLE 

X- 

-30 

VARIABLE 

X- 

-33 

VARIABLE 

X-41 

VARIABLE 

X-29 
X-30 
X-3  3 
X-41 

y-52 

X-49 

-392.76 
180.01 

-269.55 
80.998 
335.64 
891.22 

251.01 
-499.08 

95.406 
-705.60 
-1000.7 

-122.41 

-87.849 

1285.3 

1004.9 

48.598 
177.79 
432.37 

-4755.3 
2921.0 

VARIABLE  X-49 


3510.3 


2  LOG  OF  P  =  0.0 

CO:;STAMT  1  =  -29.529 

LOG  OF  CONSTANT  2  =  -5.4109 

P  =  TI!E  RATIO  OF  APRIORI  PROBABILITIES  OF  GROUP  J  AND  GROUP  I  AND/OR  COSTS  OF  GROUP  MEMBERSHIP 

FOR  QUADRATIC  EQUATIOMS  CONSTANT  2  =  DETERMINANT  (DISPERSION  MATRIX  OF  GROUP  J) /DETERMINANT  (DISPERSION  MATRIX  OF 
GROUP  I) 

quadratic  equations  v{1,3)   are  read  as  a  half  matrix  of  coefficients  defined  by  the  product  of  the  row  and  column 
terms,  plus  linear  tlli'ms  stored  in  the  first  column,  plus  constant  1. 

quadratic  classification  rules  are  as  follows — 

assic!;  to  group  i  if  for  all  other  groups  j,  the  equation  u(i,j)  is  less  than  the  natural  log  of  constant  2  mi^;us 

2  times  the  natural  log  of  p 

li::ear  equations  arf  pcad  as  a  vector  of  coefficients  plus  constant  1. 

lt't.a'-,  classificatjri;;  rules  are  as  follows  — 

afsig":  to  gpol'!'  i  if  i'or  all  other  group  j,  the  equation  u(i,j)  is  greater  than  the  natural  log  of  p 
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TABLE  la       CLASSIFICATION    OF    THE    ORIGTL'AL    SA„MFLE    IN    TEST    II 


Actual 


Predicted  Group 


Group      Total   By  Quadratic  Equation   By  Linear  Equation 


Faiic'^ 


Failed     Nonfailed    Failed    Nonfailc 


30        2  8  2  2  9  1 

(100)    (93.33)      (6.67)     (96.67)     (3.33) 


Nonfai.led 

30 
(100) 

0 
(0) 

30 
(100) 

0 
(0) 

30 
(100) 

Col.  Total 

60 
(100) 

28 
(48.67) 

32 
(53.33) 

(48 

29 

.33) 

31 

(51.67) 

Note:   Figures  in  parentheses  are  perc?;ntages , 
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as  good  as  those  in  Test  I.   This,  of  courso,  was  not 
uiiGxpectod,  as  discussed  in  the  introductory  section.   As 
shown  in  Table  11,  the  quadratic  equation  has  misclassif ied 
43.3  percent  of  the  failed  group  and  23.3  percent  of  the 
nonf ailed  group.   On  average,  the  expected  rate  of  mis- 
classification,  as  calculated  from  this  saiTiple,  is  33.3 
percent.   Although  this  rate  is  not  lov/,  it  is  also  not 
too  high  co  render  the  m.odol  useless. 

To  deterraine  the  relative  reliability  of  the  model 
and  to  see  ho-/  much  its  results  differ  from  those  obtainable 
by  pure  chance,  the  t  test  specified  in  equation  (70)  of 
Chapter  III  was  applied  to  the  misclassif ication  rate. 

If  the  discriminatory  power  of  the  model  was  equal 
to  zero  the  misclassif ication  rate  should  not  have  differed 
from  0.50. 

t  =  (Q  -  P)/ap  =  (0.6667  -  0 . 5000) /O . 0645492  =  2.583 

Tne  tabulated  t's  for  0,01  and  0.005  significant  levels  are, 
respectively,  2.326  and  2.576.   The  comparison  of  the 
com.putad  t  with  the  tabulated  t's  indicates  that  the  results 
obtained  by  the  model  significantly  differ  from  v/hat  is 
expected  by  pure  chance.   This,  in  turn,  means  that  the 
model  has  some  discriminatory  power  and  provides  some  useful 
information. 

Results 

In  an  attempt  to  examine  the  possibility  of  the 
entension  of  the  inferential  and  predictive  use  of  the  MDA 
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TABLE  11.   CLASS  IFIC/a'ION  OF  THE;  HOLLC-UT  SAMPLE  IN  TEST  II 


Actual 
GrouD 


P  r  e  d  i  c  t ,  e  d  G  r  o  u ') 


Total   By  Quadratic  Equation   By  Linear  Equation 


Failed     Nonfailed    Failed    Nonfailed 


Failed        30        17          13         23  7 

(100)  (56.67)  (43.33)  (76.67)  (23.33) 

Nonfailed      30         7          23          11  19 

(100)  (23.33)  (76.67)  (36.67)  (63.33) 


Col.  Total     60        2  4 
(100)      (40) 


36  34        26 

(60)      (56.67)    (43.33) 


Note:   Figures  in  parentheses  are  percentages 
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tccnnique  to  two  years  prior  to  the  ln-)rikru:>tcy  of  S'/curities 
brokcra';je/de3  ier  firms,  a  discriminant  model  was  constructed 
which  uses  12  viiriables.   These  variables  measure  the 
profitability,  risk  exposure,  and  mism.anagoment  of  the 
firms.   Based  on  the  attributes  specified  by  the  variabJ.es, 
the  model  separates  the  failed  from  nonfailed  group  v;ith  a 
sm.all  over].a.p  but  a  significant  disteince  between  their  means. 

Except  in  a  couple  of  cases,  the  model  was  useful  in 
analyzing  the  financial  positions  of  the  firras  and  describing 
the  attributes  of  the  failed  versus  nonfailed  firms.   As  in 
the  previous  test,  the  saiaple  firms  dem^onstrated  inverse 
relationships  betv;oen  failure  and  the  ages  of  a  firm  and 
its  president.   This  perforiaance  is  consistent  with  the 
expectation  that  the  effects  of  the  age  factors  should  have 
continued  from  the  previous  year.   It  was  also  shown  that 
the  failed  firms  in  the  sample  had  lower  ratios  of  retained 
earnings  to  equity  capital  plus  subordinated  debts,  and 
equity  capital  plus  subordinated  debt  to  total  assets.   The 
model  indicated  the  association  of  failure  with  relatively 
high  levels  of  cash  plus  other  deposits,  trading  and  invest- 
ment accoimts,  and  failed  to  deliver  and  receive  outstanding 
30  days  or  more.   The  ratio  of  total  liabilities  minus  sub- 
ordinated debt  to  equity  capital  plus  subordinated  debt  was 
also  greater  in  the  failed  firms  than  in  their  nonfailed 
counterparts.   Finally,  the  failed  firms  in  the  sample  were 
adversely  affected  by  the  percentage  changes  in  GNP . 
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TliG  behaviors  of  the    sample  firms  in  terms  of  the 
proporties  spi-cifi^d  by  tlio  above  variables  wore  consistent 
v.'ith  the  hypothesized  perf  orr.-.ances .   There  were,  hov/evcr, 
sor.ie  ambiquities  in  interpreting  the  coefficients  of  X-30 
and  X-3ji  in  the  discriminant  function.   This  difficulty 
is,  however,  understandable  if  the  statement  in  the 
introd\:ctory  section  is  considered  valid. 

With  respect  to  the  accuracy  ar.d  validity  of  the 
model,  the  96.67  percent  correct  classification  of  the 
original  sample  observations  and  33.33  percent  misclassifi- 
cation  of  the  holdout  sample  observations  prc>vide  some 
criteria  for  evaluating  the  model's  usefulness.   The  t- 
test  conducted  for  examining  the  mi sclassif ication  rate  of 
the  holdout  sample,  however,  revealed  that  the  performance 
of  the  model  significantly  differs  from  v/hat  can  be  ex- 
pected from,  pure  chance.   And,  therefore,  the  model 
possesses  some  useful  discrim.inatory  and  predictive  power. 
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NOTLS 


1.  The  last    report  criteriori  for  such  classification  is 
cluG  to  the  difforonccs  in  tlio  nurr.bcr  of  reports  among 
the  firms.   Some  firms  have  two  reports;  others  nave 
three . 

2.  For  a  list  of  names  of  these  109  firms,  see  SIPC, 
Third  Annual  Report  197  3,  p.  48,  and  SIPC,  Fourth 
Annual  Report  19  74,  pp.  'lS-31. 

3.  The  only  exception  v;as  Hill,  Curtin  and  Ackroyd,  Inc., 
for  which  only  financial  reports  of  1970  and  1972  vvere 
available.   A  m.idreport  v.'as  generated  by  using  the 
proportionality  relations  and  applying  the  procedure 
explained  on  page  219. 

4.  To  be  more  exact,  the  subject  population  is  the  one 
v/hich  has  been  handled  by  SIPC  during  its  four  years 
of  existence.   But  there  m>ight  have  been  some  other 
firms  v/hich  have  voluntarily  liquidated  or  gone  out 
of  business  v/ith  no  custom>ers'  interest  involved. 
Apparently  neither  SIPC,  nor  any  of  the  regulatory  or 
self-regulatory  organizations  is  interested  in  such 
firms  and  thus,  to  the  author's  knowledge,  there  is 
no  comprehensive  listing  available  to  show  the  namics 
of  such  firms. 

5.  To  solve  the  problem,  it  was  decided  that  the  difference 
between  the  beginning  equity  capital  and  the  ending 
equity  capital  of  the  sole  preprietorship  and  partner- 
ship could  be  used  as  a  proxy  for  their  retained 
earnings . 

6.  For  an  excellent  review  of  the  literature  up  to  1966, 
see  A.  A.  Afifi  and  R.  M.  Elashoff,  "Missing  Observa- 
tions in  Multivariate  Statistics  I.   Reviev,'  the 
Literature,"  Journal  of  the  American  Statistical 
Association,  Vo  1 .~"  61  (19  6  6),  ~pp~b3S'-^6T.       FoFlnor e 
recent  publication  on  the  subject,  see  the  references 
in  the  article  referred  to  in  note-  8,  below. 

7.  In  somie  part  of  the  study,  i.e.,  the  part  related  to 
the  Lachenbruch  classification  m.echcd,  a  combination 
of  the  observation  elim.ination  method  and  variable 
elinimation  method  has  also  been  used. 

8.  L.  S.  Chan  and  O.  J.  Dunn,  "The  Treatment  of  Missing 
Values  in  Discriminant  Analysis-I.   The  Sampling 
Experiment,  "  ^oujnifij,,  of  American  Statistical  Association , 
Vol.  6'.'  (1972),  pp.  4'73-477.       '""  " 
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9.   Sgc  the  section  on  "Compviter  Programs"  in  Chapter 
III. 

10.  According  to  Eiscnbcis-Avory ,  the  program  requiires 
about  6  rainutos  of  CPU  time  on  an  IBM  360-50  to 
select  the  optimal  subset  for  each  level  c^f  a  15- 
variable  set,  and  time  approximately  doui)les  with  the 
addition  of  another  variable.   (See  R.  A.  Eisenbeis 
and  R.  B.  Avery,  op.  cit . ,  pp.  77,  f n . ) . 

11.  See  R.  A.  Eisenbeis  and  R.  B.  Avery,  ibid. ,    pp.  76- 
77. 

12.  U.  S.  House  of  Representatives,  Securities  Industry 

Study_, Report  of  the  Subcoirimittce  on  Coirjaerce  and 

Finance   (Subcommittee  Prilit)  ,  Washincfton')  d"^^  cT: 
U.S.  Government  Printing  Office,  August  23,  1972,  pp. 
22-23. 

13.  See  R.  A.  Eisenbeis  and  R.  B.  Avery,  op.  cit.,  pp. 
9,  63,  and  92.  ~~ 

14.  For  information  on  the  differences  between  test  space 
and  reduced  space,  see  ibid. ,  pp.  19-20,  56-63,  and 
98-99. 

15.  For  more  information,  see  ibid. ,  pp.  11,  98,  and  99. 

16.  See  note  3,  above. 

17.  For  the  theoretica]^  discussion  and  biases  of  this 
method,  see  the  section  on  "The  Reliability  Tests," 
in  Chapter  III. 

18.  The  misclassif ied  firm  is  firm  no.  26  of  the  nonfailed 
group  in  the  original  sample.   It  has  also  been 
specified  in  the  related  column  (Group  Pissignment)  in 
Appendix  B. 

19.  See  the  subsection  on  "The  Scram.bled  Sam^.ilc  Method" 
in  Chapter  III. 

20.  The  variables  constructing  the  function  are:   X-10, 
X-21,  X-27,  X-28,  X-33,  X-41,  X-44,  X-63,  X-65,  X-80, 
X-82,  and  X-33.   For  analysis  purposes,  this  set  of 
variables  is,  of  course,  mioaningless ,  because  it  has 
been  derived  from  the  scrambled  observations. 

21.  The  names  of  the  12  variables  arc:   X-6,  X-12,  X-22, 
X-29,  X-33,  X-37,  X-45,  X-67,  X-70,  X-76,  X-78,  and 
X-81. 
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22.  See  note  8,  above. 

23.  The  8  data  items  v.'orc:   failed  to  deliver  within  30 
dcivs;  faiiod  to  deliver  over  30  d-iiys;  customers'  cash 
accounts,  secured  accounts,  pc\rtly  secured  accounts, 
and  unsecured  accounts;  failed  to  receive  within  30 
days;  and  failed  to  receive  over  3  0  days  (sec 
Appendix  E)  .   The  11  vari.ables  were:   X-22,  X-26, 
X-28,  X-29,  X-31,  X-32,  X-33,  X-34 ,  X-39,  X-68,  and 
X-78. 

24.  The  six  variables  are:   X-22,  X-28,  X-29,  X-33,  X-68, 
and  X-7F. 

25.  The  omitted  firms  have  been  marked  by  asterisks  in 
Appendicies  A,  B,  C,  and  D. 

26.  Vari.able  X-49  V7as  used  in  both  first  and  second  runs 
of  this  experim.ent. 

27.  For  the  names  and  characteristics  of  the  selected 
variables,  see  Appendix  I. 

28.  For  '"  discussion  of  the  Lachenbruch  n.ethod,  see 
"The  Reliability  Tests"  in  Chapter  III. 

29.  The  variables  selected  are  X-5,  X~6,  X-12,  X-15,  X-IS, 
X-22,  X-28,  X-41,  X-52,  X-58,  X~68,  and  X-73. 

30.  The  H^  Test  of  the  "normalized"  m.odol  resulted  in  an  F 
statistic  equal  to  6.163  which  is  significant  at  the 
0.0  percent  level.   This  signifies  that  the  appropriate 
procedure  is  the  quadratic  one.   Nevertheless,  for  the 
reasons  mentioned  previously  and  for  preserving  the 
comparability  of  the  models,  the  results  of  both 
linear  and  quadratic  procedures  are  reported  in 
Appendicies  iM  and  N. 

31.  Its  H„  test  resulted  an  F  statistic  equal  to  15.5  which 
is  significant  at  the  0.16860E-9  percent  level. 

32.  The  relative  discriminatory  powers  of  profitability, 
risk,  and  mismanagement  elements  have  been  calculated 
by  summing  up  the  relative  power  of  the  related 
variables  obtained  froiP  Table  4. 

33.  The  fir'T.s  having  only  tv;o  financial  reports  can  be 
identified  by  referring  to  the  related  columns  in 
Appendices  A  and  C. 
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34.  The  29  v.-iriablcs  included  in  Tost  II  a?:e:   X-2,  X-3, 
X-4,  X-5,  A-6,  X--9,  X-10,  X-11,  X-14,  X-15,  X-17, 
X-20,  X-2I,  X-22,  X-23,  X-29,-  X-30,  X-31,  X-33,  X-37, 
X-38,  X-40,  X-41,  X-49,  X-52,  X-53,  X-60,  X-61,  and 
X-83. 

35.  Tho  fir.nts  in  the  original  and  holdout  saraples  in  Test 
II  are  tho  same  firms  as  in  Test  I.   Tliis  treatment 
may  be  useful  in  extending  the  analysis  somev^hat 
further . 


chaptli^  v 
sum:l\ry  and  conclusion 

SuiTLuiarv 


The  important  rule  that  the  securities  brokerage/ 
dealer  firias  play  in  flowing  funds  in  the  nation's  capi- 
tal market  by  channeling  the  savings  of  inrJividuals  to 
corporate  users  and  transferring  the  ownership  of  out- 
standing securities  from  one  investor  to  another  makes 
any  study  in  the  direction  of  helping  the  i.aintenance  of 
fair  and  orderly  markets  for  securities  a  worthwhile 

effort. 

For  a  long  time,  the  going  presum.pt  ion  was  that  the 
efforts  directed  toward  prevcnLing  fraudulent,  deceptive, 
and  manipulative  acts  in  the  securities  markets  would 
protect  the  individual  investors  and  the  oublic  interest. 
Since  1970,  however,  the  drastic  impacts  of  the  financial 
and  operational  difficulties  of  the  broker/dealers  on  the 
securities  m.arkets  have  been  widely  recognized. 

The  Securities  Investor  Protection  Corporation 
(SIPC)  has  been  established  to  provide  li-nted  insurance 
for  the  investors  whose  broker/dealers  go  bankrupt.   Al- 
though a  significant  step,  the  creation  of  SIPC  could  not 
solve  the  problems.   For  one  thing,  SIPC  operates  an  an   . 


24  7 


241 


after-the-fact  basis.   That  is,  when  a  member  firm  falls  into 
financial  trouble,  SIPC  initiates  steps  to  have  it  placed  in 
liquidation.   Secondly,  the  protection  provided  by  SIPC 
against  financial  loss  to  customers  of  the  failed  firm  is 
limited  to  $50,000  per  customer,  no  more  than  $20,000  of 
which  may  be  for  claims  for  cash,   whereas   there  have  been 

several  cases  in  v/hich  claims  for  securities  or  cash  were 

2 

greater  than  these  limits.    And  most  importantly,  SIPC 

can  do  nothing  about  the  misallocation  of  scarce  resources 
embraced  in  each  failure  and  wastage  of  time,  money,  and 
efforts  involved  in  each  liquidation. 

It  is  desirable,  therefore,  to  contrive  a  means  by 
which  the  firm  which  is  approaching  a  financial  or  opera- 
tional difficulty  be  identified  at  the  earliest  time  pos- 
sible, so  that  opportunities  can  be  provided  for  corrective 
actions  and  potential  detriments  can  be  avoided. 

The  present  3tudy  is  an  attempt  in  this  direction. 
More  specifically,  the  purposes  of  this  dissertation  are: 

(1)  to  identify  potential  ecoraomic  and  non-economic  factors 
which  might  differentiate  between  financially-distressed 
firm.s  and  those  which  appear  to  be  economically  viable; 

(2)  to  statistically  determine  a  set  of. variables  which 
has  the  highest  discriminatory  power;  and  (3)  to  suggest 
both  public  and  managerial  action  which  might  be  taken  in 
light  of  the  statistical  results  obtained  in  point  2. 
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To  Giisuro  these  purpcsos,  an    analytic  frariowork  was 
sot  forth  in  ChajJter  II,  'wl.cj  ^.i    it  was  hyjJothosizGd  that  the 
probabili.tv  of  financial  distress  of  a  firm  at  tiir.e  t  is  a 
function  of  the  net  cash-flow  per  dollar  of  assets  at  time  t, 
extent  of  the  risk  expof'ure,  and  the  mismanagement  of  the 
firm.   It  was  also  concluded  that,  because  specific  rela- 
tions between  thiose  elements  and  the  probability  of  financial 
distress  could  not  be  known,  the  proposed  general  relation 
should  be  expanded  to  include  all  possible  factors  affecting 
the  above-micntioned  elem.ents. 

In  that  chapter,  three  potential  approaches  for 
predicting  the  financial  position  of  a  firm  in  the  near 
future  v/ere  discussed.   They  were:   "deterministic," 
probabilistic,  and  blend  approaches.   The  "deteriainistic" 
approach  was  meant  to  illustrate  the  traditicr.al  method  of 
prediction  using  the  historical  financial  ratios  in 
conjunction  with  the  MDA  technique.   The  probabilistic 
approach  v/as  stated  to  be  the  one  v^hich  does  not  treat  the 
financial  ratios  and  other  related  forces  affecting  the 
firm's  financial  positions  as  factors  whose  values  can  be 
determined  with  certainty  at  any  point  of  tim.'^  m  the  near 
future.   Rather  it  attaches  som,e  probability  distribution 
to  each  factor,  based  upon  its  historical  performance.   And,. 
instead  of  using  the  MDA  technique,  it  relies  upon  the  modern 
probability  theories. 

The  probabilistic  approach  suggested  in  this  study, 
although  intriguing,  requires  a  great  deal  of  research 
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before  it  can  be-    actually  lout  to  work.   The  blond  approach, 
used  in  the  prosont  study,  v/as  tlici-ofore  propofsed  to  act  as 
a  bridge  betw:>en  the  traditional  and  the  probabilistic 
approaches.   This  approach,  vdiile  retair.ing  all  the  favor- 
able properties  of  the  "deterministic  approach,  especially 
with  respect  to  th,e  utid.ization  of  the  MDA  technique,  in- 
corporates soine  facets  of  the  probabi.l  i  ty  approach.   More 
specitically ,  it  introduces  and  utilizes  variables  which 
measure  the  volatility  and  variations  of  most  of  the  sig- 
nificant internal  and  external  factors  having  impacts  on  the 
firm's  performance. 

Thus,  in  this  way,  most  of  the  potential  economic 
and  non-economic  factors,  v/hich  might  differentiate  betv/oen 
failed  and  nonf ailed  finvis,  v/ere  identified.   But,  before 
these  factors  were  exposed  to  the  iMDA  techniqv.e,  this  tech- 
nique was  previewed  in  Chapter  III  to  provide  the  required 
groundwork  for  the  empirical   testing   of  the  hypothesis. 

Chapter  III  presented  an  overall  picture  of  the  MDA 
technique  in  application  in  a  summary  form.   The  related 
mathematical  formulations  were  m.entioned  only  up  to  a  point 
whore  they  have  direct  bearing  on  the  understanding  of  the 
application.   A  section  of  the  historical  development, 
however,  provided  sufficient  references  for  further  infor- 
mation.  A  discussion  of  the  general  assumptions  gives  an 
idea  about  the  con.ditions  under  which  the  results  are  more 
reliable.   Relatior'  to  regression  analysis,  r--lative 


251 


ir.nortr.nce  of  variabiles,  reliability  tosts,  and  the  ava,ilable 
MDA  computer  prcgran.s  wore  the  other  topics  discussed  in 
Chapter  III.   Tlie  chapter  concludes  after  a  brief  survey  of 
th.e  application  of  MDA  in  the  financial  area. 

The  next  step  iv'as  t.o  empiri  c£-illy  test  the  suaqestod 
hypothesis.   But,  to  fully  test  the  hypothesis,  there  was  a  ' 
need  to  have  data  on  the  most  significant  factors  included 
in  the  expanded  form  of  the  hypothesized  function.   Uti- 
fortunately,  hov;ever,  the  efforts  to  obtain  sucli  data 
proved   to  be  futile.   The  Public  Reference  Office  of  the 
SEC,  although  cooperative  in  supplying  the  financial  Forms 
X-17A-5  of  the  selected  firm.s,  denied  the  accessibility  of 
Forms  X-17A-10,  on  the  grounds  that  they  v;ere  confidential. 

As  a  consequence  of  the  above  problem,  reconsidera- 
tion of  all  pertinent  factors,  substitution  of  some 
variables,  and  modification  of  the  hypothesized  function 
becam.e  an  absolute  necessity.   In  particular,  the  cash-flow 
element  in  the  function  was  substituted  v;ith  a  more  general 
term,  the  prof itablity . 

To  conduct  the  empirical  test,  60  STPC  failed  firms 
and  60  nonfailed  firms  were  selected  and  each  of  them  was 
randomly  divided  into  two  equal-size  groups.   One  group 
was  used  in  an  original  sample,  and  the  other  in  a  holdout 
sample.   The  original  Scample  was  used  to  derive  the 
discriminant  function  and  the  holdout  sample  to  test  the 
validity  and  predictability  of  the  derived  function.   These 
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samolos  were  used  in  sovor^l  exncr imc-nts ,  tv.o  of  wliich 
constituted  TuPt  I  and  Tost  IJ,    and  t ho  rest  wore  utilized 
for  evaluiit.mq  the  potent  i.al.  biases  m  the  original  dis- 
criminati nn  model,  Test  I. 

Test  I  is  the  mai.n  experiment  which  has  been 
conducted  to  analyze  the  financial  position  of  the  firms. 
This  test  is  based  upon  the  last  financjal  reports  of  the 
failed  firms  and  the  pertinent:  reports  of  the  nonfailed 
firms.   Since  the  mean  value  of  the  last  report  lag  tirn.e 
is  about  11.5  months,  the  model's  predictive  span  m.ay  be 
construed  as  being  about  one  year. 

In  this  test,  a  total  of  8  3  variables  v/as  examined 
and  12  of  them  were  selected  as  an  "optima] "  subset  by  a 
multi-stage  stepv/ise  selection  procedure.   These  variables 
measure  the  profitability,  risk  exposure,  and  mismanagement 
of  a  firm..   The  model  which  incorporates  these  12  variables 
separates  the  failed  and  nonfailed  groups  in  such  a  way 
that  there  is  virtually  no  overlap  between  them  and  the 
distance  betw-een  the  group  m.eans  is  high.ly  signficant. 

The  model  was  very  h^elpful  in  describing  the  attri- 
butes of  the  failed  and  nonfailed  firms.  The  sample  firms 
showed  the  association  of  lower  ages- of  a  firm  and  of  its 
president  with  failure.  They  also  demonstrated  that  non- 
failed  firms,  on  average,  have  relatively  greater  retained 
earnings  to  total  assets.  Tlie  interest  rates  in  tlie  prior 
year  have  also  had  {-o^itive  effects  on  the  nonfailed  firj;;;-. . 


The  relations  bctv/oen  failure  and  porcontage  clianqos  in 
equity  c;:!i:'ital,  r.-rcent  of  the  short  positions  in  the  trad- 
ing and  invostir.cnt  accounts,  and  the  coefficient  of  varia- 
tion of  the  ratio  of  the  trading  and  investinent  accounts  to 
total  assets  v.'ore  shov.'n  to  be  positiA^e.   This  direct  rela- 
tion also  existed  for  variables  dealj.rig  v;ith  partly  secured 
and  unsecured  custoniors'  securities  accounts,  total  li- 
abilities minus  subordinated  debt,  and  property  and  fixture 
manageinant . 

'Ihe  accuracy  and  validity  of  the  model  were  examined 
by  several  methods.   The  quadratic  classification  procedure 
misclassi.f ied  only  1.67  percent  of  the  observations  in  the 
original  sample.   This  very  low  percentage  of  misclass.i.f i- 
cation  indicates  th£it  the  model  is  significantly  accurate. 
The  expected  rate  of  raisclassif ication  in  the  holdout 
sample  was  13.3  percent.   This  rate  also  reveals  that  the 
discriminatory  pov;er  of  the  miodel  is  satisfactory.   The 
scrambled  method,  the  Lachenbruch  classification  procedure, 
and  the  "normalized"  model  did  not  indicate  that  the 
potential  biases  in  the  model  are  significant. 

In  an  eittempt  to  examine  the  possibility  of  the 
extension  of  the  inferential  and  predictive  use  of  the  MDA 
technique  to  two  years  prior  to  the  bankruptcy  of  securities 
brokerage/dealer  firms.  Test  II  was  conducted.   This  test 
also  utilized  the  quadratic  discriminant  model  with  12 
variables.   Based  upon  the  inform.at  ion  contained  in  the 
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financij]  rcoorLs  i.ssuod  two  years  pr.icr  to  bankruptcy,  the 
nioclo]  constructed  in  this  test  v;iis  able  to  reclassify  96.7 
percent  of  the  observations  in  the  original  sample  correctly. 
The  expected  rate  of  misclassif  ica  t  i  on  in  t-ie  holdout  sample 
was  not,  h.owevor,  as  good  as  that  of  Test  T.   This  rate  was 
33.3  percent.   Although  this  rate  is  high,  it  is  sj gni f icantly 
better  than  pui:e  clianco.   The  result,  however,  v/as  not  unex- 
pected because  the  specific  nature  of  the  securities  industry 
and  the  kinds  of  the  available  data  make  any  remote  prediction 
difficult,  if  not  impossible. 

Conclusion 

The  relatively  lov;  rates  of  m.isclassif ication  both  in 
the  original  sample   and  holdout  sam.ple  of  Test  I  along  with 
the  results  of  H   test  and  group  overlap  examination  indicate 
that: 

-  The  failed  securities  brokerage/dealer  firms 
possess  somie  attributes  which  are  different 
from  those  of  the  nonf ailed  firms, 

-  Those  attributes  can  be  identified  by  the  blend 
approach  v;hich  incorporates  some  probabilistic 
variables,  even  with  such  a  scant  amount  of 
financial  data  as  those  available  to  the  present 
study.   And, 

-  The  attributes  can  be  utilized  in  predicting  the 
failure  for  a  period  as  long  as  ab^jut  one?  year 
prior  to  bankruptcy. 
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Furthcrp.oro ,  tlio  relative  di  scr  i:ni  natory  power  of  the 
selected  variables  sliow  that  the  risk  eicnent  is  the  most 
significant  determinant  of  the  firin's  survival.   The  i;ripor- 
ance  of  prof  i  tabi.li.ty  is  next  and  that  of  the  misraanagement 
is  last. 

The  results  of  Test  II  were  not  as  qood    as  Test  I. 
This  was  not  unexpected,  however,  and  may  signify  that  the 
more  remote  span  of  the  prediction  weakens  the  predictive 
power  of  the  model,  and  for  obtaining  better  results  there 
is  an  absolute  need  for  a  more  complete  set  of  data  than 
what  was  available  to  the  present  study. 

Imp] ications 

The  results  obtained  from  this  work  indicate  that 
the  blend  approach  applied  to  the  empirical  experiiaent  is  a 
useful  method  for  dealing  with  predictive  issues  of  this 
nature.   A  mode],  of  the  type  developed  in  this  study  can  be 
readily  used  by  all  the  parties  having  interests  in  a 
securities  brokeragc/'dealer  firm. 

Since  the  enactment  of  the  Securities  Act  of  1933, 
the  securities  industry  has  been  i:;creasing  ly  put  under  the 
scrutiny  and  control  of  numerous  governmental  regulatory 
and  self-regulatcry  organizations.   The  numerous  and  ever- 
increasing  rules  and  regulations;  the  various  forms,  reports 
and  paper'.vork  requirements;  and  the  overlapping  and  occa- 
sionally conflicting  functions  of  these  agencies  have  creat.: 
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a  situation  which  is  fully  reflected  in  the  related  studies 

reviev.'ed  in  Chaj:)toi;  I  and  is  partially  indicated  by  the 

v;ords  of  the  Subconxniittee  on  Securities  when  it  says: 

...Regulation,  by  government  and  by  industry 
groups,  is  an  essential  element  in  protec- 
tion of  investors,  but  is  not  an  effective 
substitute  for  competit-.ion  in  assyring  a 
flexible  and  healthy  industry....-^ 

Under  such  a  circumstances,  if  and/or  v/hen  a   model 
such  as  the  one  employed  in  this  study  can  be  developed 
in  the  best  way,  some  of  these  problems  may  be  solved.   By 
using  such  a  model,  the  management  of  a  firm  may  m.onitorize 
itself  before  the  situation  becomes  irreversible.   The 
creditor  may  effectively  evaluate  the  firm's  positions  be- 
fore feeling  sorry.   The  investor  m,ay  take  a  right  3':cp 
before  losing  his  capital.   And  the  free-market  principle 
may  become  operative  before  it  collapses. 

Al]  these  can  be  accomplished  only  if  the  informa- 
tion upon  which  the  model  is  constructed  is  accurate, 
suitable,  reliable,  and  available  to  the  public.   As 
eir>ph<asized  by  the  Subcormnitteo  on  Securities  in  its 
Securities  Industry  Study,  a  hign  order  of  importance  should 
be  accorded  to  tlie  public  disclosure  of  the  financial  con- 
dition of  brokers  and  dealers.   In  the  Subcomjnittee '  s  words: 

...The  i.mporrance  of  public  disclosure  of 
information  regarding  the  financial  condition 
of  brokers  and  dealers  should  not  be  under- 
estimated .  .  .  . 

In  view  of  the  above  comnicnts,  it  seems  that  the 

public's  interest  is  better  served  if  efforts  are  focused 
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on  croatiiiC}  conditions  whic;h  lead  to  cho  provision  of 
accurate-,  suitable,  roliable,  and  acce^.'siblo  financial  data 
of  the  securities  brokerage/dealer  firms. 

S u gge s t  i o ns __ f  or  Further  Research 

Based  upon  the  results  of  this  study,  two  areas  seeui 
to  deserve  further  research.   The  first  is  short-run  and  is 
more  practical.   The  second  one  is  raore  suitable  for  a  long- 
range  endeavoi:  and  is  theoretical  in  nature. 

First  A_re_a 

This  area  may  be  considered  as  application  of  the 
analytic  framework  set  forth  in  Chapter  II  to  a  more  complete 
set  of  data  than  what  v/as  available  to  the  present  study. 

Second  Ai-ea__ 

This  area  may  be  considered  as  the  consecutive  steps 
leading  to  the  development  of  the  probabilistic  approach 
to  prediction,  as  discussed  in  Chapter  I.   More  specifi- 
cally, the  research  needed  comprises: 

(1)  Identifying  all  the  probabilistic  factors 
affecting  the  probability  of  the  firm's  financial  distress. 

(2)  Determining  the  inter-relations  between  these 
factors , 

(3)  Finding  a  suitable  way  to  estimate  the  a  priori 
probabilities  of  such  factors,  and  finally 

(4)  Formulating  the  probabilistic  m.odel. 
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MOTES 


1.  Recently,  it  has  been  proposed  that  the  above-mentioned 
limit  be  i-ai?ed  to  $100,000,  of  v/hich  no  ruore  than 
$40,000  may  be  for  claims  for  cash.   But,  even  when  the 
proposal  is  enac:ted,  it  is  expected  that  tlie  uncovered 
cases  would  still  remain  in  a  considerable  number.   This 
would  be  due  to  both  inflation  and  the  higher  level  of 
the  accumulated  wealths. 

2.  See  STPC  Fourth  Annual  Report  1974,  p.  2 

3.  See  Note  7  of  Chapter  I. 

4.  U.S.  Senate,  Securities  Industry  Study,  Report  of^the 

S ubc ommittee  o n_s'e c ur  j-ties  ,  Committee  on  Banking,  Hous- 
ing, and  Urban  Affairs   (Conar.ittee  Print),  Washington, 
rT.  C  .  ,  uTs".~'Govei'nrain~P  r  in  ting  Office,  February,  1973, 
p.  39. 
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APPKNDIX  E 
A  LIST  or  DATA  SET  ITEr-IS 

President's  aoo. 

President's  od^^cational  background  (coded  0  for  under  high 
school,  1  for  high  school,  2  for  junior  college,  3  for 
a  bachelor's  degree,  4  for  a  master's  degree,  and  5  for 
a  doctoral  degree) . 

President's  experience  in  general  business  (other  than 
securities  business)  during  past  ten  years. 

President's  experience  in  securities  business  during  past 
ten  years. 

Firm's  ago  in  years. 

Last  Report  Lag  Ti.me  (LRLT)  in  m.oriths. 

Date  of  the  last  report. 

Number  of  reports. 

Date  of  the  subject  report. 

Total  assets. 

Cash  on  hand  and  in  banks  plus  certificate  of  deposits. 

Total  accounts  receivable. 

Accounts  receivable  from  broKcr/dealers  (total) . 

Failed  to  deliver  (total). 

Failed  to  deliver  within  30  days 
Failed  to  deiiver  over  30  days 

Other  broker/dealers  accounts  v/ith  debit  balances. 

Accounts  receivable  from  customers  (see  Question  6  of 
the  Financial  Questionnaire,  Form  X-17A-5). 

Cash  accounts  (see  Question  6A.1  of  the  form 

above) . 
Secured  accounts  (see  Question  6B.i  of  the  form 

above) . 
Partly  secured  accounts  (see  Question  6C.1  of  the 

for  above) . 
Unsecured  accounts  (see  Question  60  above) . 
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Account.3  receivable  irom  directors  av.d    officers 

Accour^ts  with  debit  balajicos  on  trading  and  investment.. 

Accounts  v/ith  credit  balances  on  tradi.ng  and  investment. 

Property,  furniture,  fixtures,  and  leasehold. 

Deposits  other  than  the  certificate  type. 

Excliange  membership  value. 

Money  borrowed,  secured  by  or  containing  customer's 
collateral . 

Money  borrowed,  secured  by  the  firni's  collateral. 

Total  accounts  payable. 

Accounts  payable  to  broker/dealers. 

failed  to  receive. 

Failed  to  receive  within  3  0  days. 
Failed  to  receive  over  30  days. 

Other  broker/dealers  accounts  v.'ith  credit 
balances . 

Accounts  payable  to  custom.ers. 

Accounts  payable  to  directors  and  officers. 

Type  of  the  organization  (coded  0  for  corporation,  1  for 
partnership,  and  2  for  sole  proprietorship) . 

Total  capital  account. 

Stocks 

Treasury  stocks 
Paid-in  surplus 
Retained  earnings. 

Subordinated  debt. 

Commission  accounts  with  debit  balances. 

Commission  accounts  with  credit  balances. 

Time  period  between  two  consecutive  reports. 

Population  of  the  city  where  the  firm's  main  business  is 
located . 


2  69 


Per  capita  income  of  the  city  mentioned  above. 

GNP  in  the  reporting  year  period. 

Dow-Jo]',cs  Total  Index  (annual  averages  of  monthly  figures)  . 

Standard  snd  Poor's  Common  Index  (annual  averages  of  monthly 
figures) . 

Interest  rates  (bank  rates  on  short-term  business  loans) 

Total  market  value  of  bond  and  stock  sales  in  the  reporting 
year . 


appl;ndix  f 

LIST  OF  Tlii:  VARIABLE^ 


X-1   -  Co;?>pany's  code  nainber. 

X-2   --  Prosidont's  educational  backcjround  . 

-'^-3   =  Fi'esidc-ri!: '  s  experience  in  general  busines  (years). 

X-4   -     President's  experience  in  securities  business 

(years) . 
X-5   -"  F'irm's  age. 
X-6   "  President's  age. 
X-7   -  Cash/TA* 

X-8   -   (Cash  -V    Oidier  Deposits  ^  Membership) /T/\* 
X-9   :-   (Cash  +  Otlier  Dopes i  ts) /TA. 
X-10  =  Reta-iried  Earnings/Equity  Capital. 
X-11  ^--     Retairied  P:arnings/ (Equity  Capital  +  Subordinated 

Debt) . 
X-1 2  "  Retained  Earnings/TA. 
X-13  -  Equity  Capital/TA* 

X-14  =■■  Total  Liabilities/Equity  Capital. 
X-15  =   (Equity  Capital  -i-  Subordinated  Debt)/TA. 
X-16  =   (Total  Liabilities  -  Subordinated  Debt) /Equity 

Capital* 
X-17  ^  Total  Liabilities  -  Subordinated  Debt )/ (Equity 

Capital  +  Subordinated  Debt) . 
X--18  =   (Total  Liabilities  -  Subordinated  Dobt)/TA. 
X-19  =  Total  Liabilities/TA. 
X-20  =  Subordinated  Debt/ (Equity  Capital  +  Subordinated 

Debt) . 
X-21  =  AC  Receivable/TA. 
X-22  ^   (Partly  Secured  AC  +  Unsecured  AC)/TA. 

Partly  Secured  AC  are  customers'  securities  accounts 

liquidating  to  a  deficit  (Question  6,C   of  the 

Financial  Questionnaire) .   Unsecured  AC  are 

customers'  securities  accounts  which  are  reported 

under  Question  6,D  of  the  Questionnaire. 

AC  Rcceiveable  from  ED/TA. 

AC  v;ith  Debit  Balances  on  Trading  and  Investment/TA* 

Failed  to  Deliver/TA* 

Failed  to  Deliver  v/ithin  30  Days/TA*" 

(F'ailed  to  Deliver  +  Failed  to  Receive) /TA. 

Failed  to  Deliver  Outstanding  2'^    days  or  miore/TA. 

(Failed  to  Deliver  30  Days  or  More  ^-  Failed  to 

Receive  3  0  Days  or  More)/TA. 

AC  Receivable  fi;om  Cus tom.ers/TA. 

Failed  to  Receive  within  30  Days/TA* 

Customers'  Securities  Cash  AC  (Question  6,A)/TA* 

Customers'  Securities  Secured  AC  (Question  6,B)/TA. 

Customers'  Securities  Unsecured  AC  (Question  6,D)/TA* 

(Borrowings  +  AC  Payable) /TA* 
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X- 

-23 

X- 

-24 

X- 

-25 

X- 

-2  6 

X- 

-27 

X- 

-28 

X- 

-29 

X- 

-30 

X- 

-31 

X- 

-32 

X- 

■33 

X- 

-34 

X- 

-35 

21 


X-36      ^       (Borroo'intjs    +   AC    Payabl.o    +    Subordinated    Debt)/ 


X-3  7 
X-38 
X-3  9 
X-4  0 
X-41 


x-4: 


X-4  4 
X~4  5 


y , 

46 

x- 

-47 

x- 

-43 

X- 
X- 

-49 
-50 

X-51   = 


X-52 


X-53  = 

X-54  = 

X-55  = 

X-5  5  - 

X-57  = 

X-58  = 


Borro.-;i  ncjs/TA. 

AC  Payable  to  DD/TA. 

Fai].;.'d  to  R..?coivo  Outs  Landing  3  0  days  or  More/T7v* 

AC  PayabjG^  tcj  Custoirier/'T/i . 

(AC  v;ith  Debit  Balances  on  Tj-adijig  and  Investnont 
+  AC  with  Credit  Balances  on  Trading  and  Tnvost- 
raont)  /TA. 

■.(Total  Liabilities  -  Subordinated  Debt) /ATA 

(changes  from  the  year  before  last). 

:\  (Total  Liabilities  -  Subordinated  Debt) /^Equity 

Capital  (changes  fror.i  the  year  before  last.) 

A  Cash/AT7^  (changes  from  the  year  before  last)  . 


Abquity  Capital/ (Equity  Capital) 


t-i: 


The  numerator  is  the  change  in  equity  capital 

from  the  year  before  last  to  the  last  year,  and 

i-.]ie  denomiriator  is  the  amount  of  the  equi.ty 

capital  in  the  year  before  last. 

AEquity  Capital/ATA  (changes  from  the  year  before 

last)* 

ARetained  Earnings/ (TA)    .   (Similar  explana- 
tion as  for  X-45.) 

ARetained  Earnings/ATA  (changes  from  the  year 

before  last) . 

AGNP/(GNP) ^_j  (similar  explanation  as  for  X-45). 

(Rl-di) /c-j^ ,  v;here  R^  is  the  firm's  age,  d-,  is  the 
disaster  level  for  the  age  factor  which  is  the 
mean  value  of  the  ages  of  the  failed  firms  in  the 
original  sample,  and  o  .^  is  the  standard  deviation 
of  the  values  whose  mean  is  d,.--* 

(^2~^2 ) /'-''' /  where  R2  is  the  age  of  the  firm. 's 
president ,  d2  is  the  disaster  level  for  the  age 
factor  which  is  the  mean  value  of  the  ages  of 
the  presidents  of  the  failed  firms  in  the  original 
sample,  an.d  02  is  the  standard  deviation  of  the 
values  whose  mean  is  d-,.TT* 
AC  with  Credit  Balances  on  Trading  and  Investniont/ 

(AC  with  Debit  Balances  on  Trading  and  Investment 
-t-  AC  with  Credit  Balances  or^  Trading  and  Invest- 
ment) . 

Property  and  Fixtures  (Question  13  of  the 
Questionnaire) /TA. 
GNP  in  the  reporting  year. 

Dow- Jones  Total  Index  (annual  averages  of  monthly 
figures) . * 

Standard  and  Poor's  Coiranon  Index  (annual  averages 
of  monthly  figures).* 

(Interest  Rate)  ,  whore  t  stands  for  the  firm's 
last-report  year. 

(Interest  Rate)    ,  where  t  stands  for  the  firm's 
last-report  year. 
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X-59 

X-6  0 
X-61 
X--G2 
X-6  3 
X-b4 
X-65 
X-6  6 
X-67 
X--68 
X--69 
X-70 
X-71 
X-7  2 
X-73 


X-74 

X-75 

X-76 

X-77 

X~78 

X-79 

X-8  0 

X-81 

X-8  2 

X-83 
X-84 


Total  Nlarkot  Valuo  of 
ropoi-ting  year  and  in 


Bond  and  Stock  Sales  (in 
$  million) * 


the 


>ita 


Loq^O  o 
Per  Ca; 
Failed  to  R. 
Coefficient 
Coef f  icient 
Coot f ] c lent 
Copff  icient 
Coef f ioient 
Coef  f icient 
Coef  f  icier,  t 
Coefficient 
Coef f ici  cnt 
Coefficient 


iron's  City  Po 
Tncorao  on  Firm' 
■CO  ive/TA* 
of  Variation  of 
of  Var  ration  of 
of  Variation  of 
of  Variation  of 
of  Variation 
of  Var.iation 
of  Va]-aation 
of  Variation 
of  Variation 
of  Variation 


of 
of 
of 
of 
of 
of 


(R3-d3)/a3,  where  R3 


and 


vjlation . 
5  City. 

X-7. 

X-14  . 

X-15. 

X-12. 

X-2  5. 

X-22  . 

X-41. 

X-52. 

X-53  . 

X-62. 

are,  respectively,  mean 


(R4"d, 


(Re 


-cl.)/a. 


(R,^-d.^)/a.^ 


as  in 


as  in 


and  standard  deviation  of  X-7  and  d3  is  the  disas- 
ter level  for  X-50  which  is  calculated  by  averag 
ing  the  values  of  X-50  in  the  failed  firms  group 
of  the  original  sample. ft 

/a^  (similar  explanation 
for  X-14)* 

(similar  explanation  as 
for  X-15)  . 

(similar  explanation 
for  X-12) . 

(sim.ilar  explanation  as 
for  X-25) . 

(similar  explanation  as 
for  X-2  2) . 

(similar  explanation  as 
for  X-41) . 

(similar  explanation 
for  X-52) . 

(similar  explanation  as  in 
for  X-53) . 

(similar  explanation  as  in 
for  X-62) . 

Total  Assets  (a  proxy  for  the  firm's  size). 
A  (Standard  and  Poor's  CoiiLmon  Index)  /  (Standard  and 


(Rg-d 


)/c 


8 


(Rg-dg)/ag 

(^10-^10^/' 


x-7  3, 
X-7  3, 
x-73, 
in  X-7  3, 
in  X-7  3, 
X-73, 


m 


but 
but 
but 
but 
but 
but 
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Poor's  Comm-on  Index) 


,  where  the  nominator  is 
change  from  year  t-l"  "  to  year  t,  to  show  the 
result  as  the  percentage  change* 


'For  the  data  items  used  in  constructing  these  variables, 
see  Appendix  E.   Meanwhile,  the  following  abbreviations 
and  symbols  have  been  used  in  the  variable  list,  for 
s  imp 1 i  f  ic  at  i  on : 


AC  for  /-.ccounts. 

BD  iur  BrokGr/c-ca]  ors  . 

T-'v  f  o  r  To  t  a  ].    A  s  s  g  1 3  . 

A  for  cluinges    iu. 

/  for  divisior. 


27  3 


i'or    ana  lytic  ill    dif^cus; 
section   on    "varia"i:)le    ; 


on    on    this    v^ariable, 
■t"    m    Test    I. 


see  the  sub- 


*ThesG  variables  vvere  later  omitted  frora  the  analysis  due 
to  their  high  correlations  vith  other  variables  (see 
Appendix  G) . 
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APPENDIX  H,  PART  I 
VARIABLE  STRUCTURE  OF  GROUP  FAILED 


Standard 

standard  Error 

Variable 

Mean 

Deviation 

of  Mean 

Maximum 

Minimum 

F-Statistic 

X-2 

2.500000 

0.9566892 

0.1235030 

5.000000 

0.0 

2.668944 

X-3 

2.645826 

2.740095 

0.3537447 

9.000000 

0.0 

4.950998 

X-4 

6.R36727 

2.  776093 

0.3583919 

10.00000 

1.000000 

20.69029 

X-5 

6.383333 

5.831218 

0.7528069 

34.00000 

2.000000 

27.  33253 

X-6 

39.18367 

8.280439 

1.068999 

60.00000 

25.00000 

52.67641 

X-9 

0.1809633 

0.1837232 

0.2371855E-01 

0.7867000 

-0.2960000E-01 

3.  122393 

X-10 

2.  303801 

10.29112 

1.328578 

55.96989 

-12.88090 

1.949  568 

X-11 

0.9263214 

13.91727 

1.796711 

102.8907 

-19.85579 

0.1136909 

X-12 

-0.4725481 

1.221793 

0.1577328 

0.4080999 

-7.498400 

15.64845 

X-14 

0.5817980 

17.48567 

2.257390 

45.51208 

-55.63170 

0.6154320 

X-15 

0.2851364 

0.2830713 

0.3654435E-01 

0.7873000 

-1.081800 

25.28076 

X-17 

0. 3761680 

16.62885 

2.146774 

21.07790 

-119.8987 

0.7022991 

X-18 

0.7148631 

0.2830716 

0.3554439E-01 

2.081800 

0.2126999 

25.28110 

X-19 

0.8936597 

0.4724157 

0.6098860E-01 

3.834200 

0.2126999 

27.09923 

X-20 

' 0.8385843E- 

-01 

2.918872 

0.3768247 

3.182300 

-21.46999 

0.1029068E-01 

X-21 

0.4080749 

0.2546532 

0.3287558E-01 

0.9367000 

0.0 

4.69991 

X-22 

0.2224332E- 

-01 

0.3780302E- 

-01 

0.4880346E-02 

0.2093000 

0.0 

18.05386 

X-2  3 

0.2278966 

0.2149901 

0.2775510E-01 

0.9367000 

0.0 

4.639874 

X-27 

0.4701065 

0. 3892435 

0.5025112E-01 

1.570299 

0.0 

16.52359 

X-28 

0.2053833E- 

-01 

0.2829296E- 

-01 

0.3652605E-02 

0.1372000 

0.0 

22.03366 

X-29 

0. 56688 JIE- 

-01 

0.6480438E- 

-01 

0.8366209E-02 

0.2818000 

0.0 

33.48315 

X-30 

0.165066 

0.1706983 

0.2203706E-01 

0.8297000 

0.0 

0.6316518 

X-33 

0.3402665E- 

-01 

0.6410486E- 

-01 

0.8275900E-02 

0.3349000 

0.0 

0.7226651 

X-37 

0.1  ',8668  3 

0.2112305 

0.2726973E-01 

1.070200 

0.0 

0.5831894 

X-3^ 

0.2020766 

0.2249205 

0.2903711E-01 

0.7682000 

0.0 

9.772126 

X-40 

0.1449149 

0.1301529 

0.1680266E-01 

0.5045000 

0.0 

3.589977 

X-41 

0.27^i7480 

0.2587057 

0.3339875E-01 

1.027399 

0.0 

0.7568508E-01 

V_  ',   '• 

1.  i.'l'OH 

4.152411 

0.5360740 

32.18639 

-3.222199 

0.6493771 

'■:-', '', 

2.  •  :9875 

44.66104 

5.765715 

187.1410 

-267.4263 

0.3M:9647 

Standard 

Standard  Error 

Variable 

Kean 

Deviation 

of  Mean 

Maximum 

Minimum 

F-Statistic 

X-44 

0.2531917 

3.542739 

0.4573656 

13.68660 

-22.18379 

0.1666017 

X-45 

0.2476949 

0.8268071 

0.1067403 

3.699800 

-0.9665999 

2.770066 

X-47 

-0.2355668E- 

-01 

0.6619875 

0.8546221E- 

-01 

3.258400 

-1.468100 

1.066381 

X-48 

1.383699 

9.673825 

1.248885 

72.22879 

-6.102900 

1.011167 

X-49 

0.3844500E- 

-01 

0.2378771E- 

-01 

0.3070980E- 

-02 

0.6130000E-01 

-0.4299998E- 

-02 

0.5817761 

X-52 

0.4409998E- 

-01 

0.1290876 

0.1666514E- 

-01 

0.6296000 

0.0 

5.057554 

X-53 

0.e894493E- 

-01 

0.1374604 

0.1774606E- 

-01 

0.7622000 

0.0 

1.016790 

X-54 

765. 1680 

36.74188 

4.743356 

839.2000 

722.5000 

0.2457964E- 

-01 

X-57 

6.860497 

1. 13TlHn 

0.1462928 

8.480000 

5.820000 

0.2297439E- 

-01 

X-58 

7.262996 

1.077483 

0.1391025 

8.480000 

5.820000 

1.974225 

X-6  0 

5.951521 

0.9619000 

0.1241807 

6.897400 

3.000000 

0.2924239 

X-61 

3502.067 

452.7112 

58.44475 

5343.000 

2529.000 

0.2805504 

X-6  3 

0.5972832E- 

-01 

0.7490754E- 

-01 

0.9670518E- 

-02 

0.3848000 

0,9999999E- 

-04 

11.11113 

X-6  4 

2  3.6  326  3 

60.94411 

7.867849 

369.4722 

0.2360000E- 

-01 

8.677245 

X-65 

0.1594649 

0.4343243 

0.5607102E- 

-01 

2.653299 

0.9999999E- 

-04 

6.099426 

X-66 

0.2346463 

0.5930182 

0.7655a29E- 

-01 

3.403999 

0.7000000E- 

-03 

0.3702107E- 

-01 

X-6  7 

0.6753993E- 

-01 

0.1022211 

0.1319668E- 

-01 

0.6184000 

0.0 

13.19552 

X-6  8 

0.1421666E- 

-01 

0.2445292E- 

-01 

0.3156858E- 

-02 

0.1138000 

0.0 

10.44350 

X-69 

0.7233328E- 

-01 

0.8629304E- 

-01 

0.1114038E- 

-01 

0.3695000 

0.0 

6.700940 

X-70 

0.2631165E- 

-01 

0.7634193E- 

-01 

0.9855699E- 

-02 

0.4197000 

0.0 

5.048196 

X-71 

0.1973666E- 

-01 

0.4278952E- 

-01 

0.5524099E- 

-02 

0.2412000 

0.0 

0.3880569 

X-72 

0.6^60660E- 

-01 

0.8276087E- 

-01 

0.1068438E- 

-01 

0.4055000 

0.0 

3.293394 

X-73 

2.084451 

5.855290 

0.7559147 

35.86809 

-15.79490 

0.2256026E- 

-01 

X-7  5 

4.062044 

7.271001 

0.9386821 

54.20029 

-0.4127000 

5.031442 

X-76 

2.480680 

5.842123 

0.7542148 

34.76849 

-1.754700 

25.69186 

X-77 

1.8.15468 

6.024769 

0.7777942 

40.76649 

-8.366300 

4.021666 

X-7S 

-1.91 3857 

19.90100 

2.569207 

64.39659 

-137.2993 

5.740931 

X-79 

0.9710279 

3.105337 

0.4008973 

8.115399 

-15.25020 

0.7917457 

X-30 

-0.2494267 

3.695612 

0.4771014 

9.294399 

-19.48340 

1.949669 

X-81 

-1.569878 

6.641584 

0.8574248 

19.48299 

-21.46959 

3.454923 

X-R2 

3.300065 

9.936050 

1.282738 

53.11130 

-11.65420 

4.365307 

X-R2 

769987.4 

1140052. 

147180.1 

6769102. 

19431.00 

5.333261 
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APPENDIX  J 
EQUATION  USED  FOR  CLASSIFICATIONS  IN  REDUCED  SAMPLE 


LINEAR  "iND  QUADRATIC  EQUATIONS  IN  TEST  SPACE 
1.   LINEAR  EQUATION  U(FLD,  NFLD)  = 


VARIABLE  X-5 
-0.81061E-01 


VARIABLE  X-6 
-0.98117E-91 


VARIABLE  X-12 
-4. 3670 


VARIABLE  X-4  5 
2.4267 


VARIABLE  X-58 
-1.2611 


VARIABLE  X-64 
0.46988E-01 


VARIABLE  X-6 7 
16.119 


VARIABLE  X-73 
0.17130 


VARIABLE  X-75 
-0.37521E001 


CONSTAtJTl    =    13.49  4 
LOG   OF    P   =    0.27444 
2.       OU'VDRATIC    EOUATION    U  (FLD    ,NFLD)     = 


LINEAR    TrPMS 

VARIABLE    X-5 

-5 

2.5417 

0.35702E-01 

-6 

•-l.Oor.T 

-0.80917E-02 

-12 

16.676 

0.54493 

-45 

-8.1305 

0.33343E-01 

-58 

2.5921 

-0.33756 

-64 

0.92617 

0.61041E-01 

-67 

-1.66^n 

-0.66996 

-73 

-0.22215 

-0.58472E-01 

-75 

-1.70  57 

-0.n290E-01 

-76 

-  2  .  8  f,  f!  7 

-0.576481>01 

-81 

0. !(,]',■■'. 

0.18943E-01 

;-82 

1.715-5 

-0.19440E-01 

VARIABLE  X-76 
-0.18911 


VARIABLE  X-6 


0.891 

-0.323 

-0.298 

0.664 

-0.383 

1.797 

0.629 

-0.698 

0.265 

0.247 

0.115 


lE-02 

92 

61E-01 

99E-01 

60E-01 

1 

34E-03 

25E-02 

15E-01 

37E-02 

03E-01 


VARIABLE  X-81 
0.14428E-01 


VARIABLE  X-82 
0.69513E-01 


VARIABLE  X-12    VARIABLE  X-4  5 


-31.015 
24.345 

0.93238 
-3.9799 
61.295 

0.70418 
-0.23004 
-0.61608 

0.86949E-01 
-0.25778E-02 


-26.072 
1.5798 
-0.49030 
-39.642 
-0.16918 
-0.18066 
-0.47618 
0.  38222E-01 
0.14602 


VARIABLE  X-53   VARIABLE  X-64 


-0.79097E-01 

0.35533 
-9.2045 

0.43235E-01 

0.12723 

0.23345 
-0.12518 
-0.23954 


-1.2760 

20.566 

0.80749E-01 

-0.18837E-01 

-0.92430E-01 

0.12778E-01 

0.62304E-01 


(SECTIOM 

2) 

VARIABLr  X- 

-67 

VARIABLE  X-73 

VARIABLE  X-75 

VARIABLE  X-7 

X-67 

-927.77 

X-73 

-2.1'-61 

o.70070r;-oi 

X-75 

0.37371 

-0.12367 

0.24033 

X-76 

-2.00?n 

0.39OO8E-01 

0.10314 

0.28388 

X-81 

0.120^7 

-0.15201E-01 

-0.  33017E-01 

0.42868E-01 

X-82 

-1.1632 

-0.2.1107E-01 

-0.19630E-01 

-0.85487E-02 

VARIABLE  X-81    VARIABLE  X-82 


0.35014E-01 

0.10410E-01       -0.14050 

2  LOG  OF  P  =  0. 54887 

CONSTANT  1  =  -8.79  50 

LOG  OF  CONSTANT  2  =  -3.0834 

P  =  THE  RATIO  OF  APRTORI  PROnXBILITIES  OF  GROUP  J  AND  GROUP  I  AND/OR  COSTS  OF  GROUP  MEMBERSHIP 

FOR  OUADRATIC  KQUATKi-JS  CONSTANT  2  =  DETERMINANT  (DISPERSION  MATRIX  OF  GROUP  J) /DETERMINANT  (DISPERSION  MATRIX  OF  GROUP  I) 

OUADPATIC  EOUATIOrjr  ('(I, J)  ARE  READ  AS  A  HALF  MATRIX  OF  COEFFICIENTS  DEFINED  BY  THE  PRODUCT  OF  THE  ROW  AND  COLUMN  TERMS, 
PLUS  LINEAR  TERMS  STORED  IN  THE  FIRST  COLUMN,  PLUS  CONSTANT  1. 

QUAD^'ATIC  CL^SSIPICATrON  RULES  ARE  AS  FOLLOWS  — 

ASSir-l  TO  GROU!'  1  IF  FOR  ALL  OTHER  GROUPS  J,  THE  EQUATION  U(I,J)  IS  LESS  THAN  THE  NATURAL  LOG  OF  CONSTANT  2  MINUS  2 

TIMES  THE  NATUPJ\L  LOG  OF  P 

LINEAR  EQUATIONS  AFE  READ  AS  A  VECTOR  OF  COEFFICIENTS  PLUS  CONSTANT  1 

LINEAR  CLASSIFICATION  RULES  ARE  AS  FOLLOWS-- 

ASSIGN  TO  GROUP  1  IF  FOR  ALL  OTHER  GROUPS  J,  THE  EQUATION  U(I,J)  IS  GREATER  THAN  THE  NATURAL  LOG  OF  P 


apfendix  k 

classiftcatig::  of  the  reduced  sample  using  the 
standard  method 


Actual 


Predicted  Group 


?otal   By  Quadratic  Equation   By  Linear  Equation 


GroJp  Failed     Nonfailed     Failed   Nonfailed 


Failed         38        33           5  32         6 

(100)  (86.84)  (13.16)  (84.21)  (15.79) 

Nonfailed      50         5          4  5  2        48 

(100)  (]0.C0)  (90.00)  (4.00)  (95.00) 

Col.  Total     88        38          50  34        54 

(100)  (43.18)  (56.82)  (38.64)  (61.36) 


Note:   Figures  in  parentheses  are  percentages 
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APrENDIX  L        ■ 

CLASSIFICATIO:;  Oi'  TliE  Ri'DUCED  SAMPLE  UST:.'G  THE 
LACHENBRUCH  METHOD 


Predicted  Grou: 


Actual     Total   By  Quadratic  Equation   By  Linear  Equati^ 
GrouD 


Fail.ed     t\'onf  ailed     Failed   Nonf  ailed 


Failed 
Nonf ailed 
Ccl.  Total 


38       31 
(100)   (81.58: 


7  31  7 

(18.42)     (81.58)    (18.42) 


50        7           4  3  2         48 

(100)  (14.00)  (86.00)  (4.00)  (95.00) 

88       38           50  33         55 

(100)  (43.18)  (56.82)  (37.50)  (62.50) 


Note:   Figures  in  parentheses  are  percentages. 
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ap}t;!;dia  m 
CLASSiric;\TiON  or   Tiih:  originat,  sample  using  the 

"NORMALIZED"  MODEL 


Predicted  Group 


Actual     Total   By  Quadratic  Equation   By  Linear  Equation 
Group 


Failed     Nonfailed     Failed   Nonfailed 


Failed         30       28 

(100)   (93.33) 


2  30  0 

(6. 67)     (100.00)    (0.00) 


Nonfailed      30        0  30 

(100)    (0.00)     (100.00) 


1         29 

(3.33)   (96.67: 


Col.  Total     60       28  32  31         29 

(100)   (46.67)      (53.33)      (31.67)   (48.33) 


Note:   Figures  in  parentheses  are  percentages 
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APPENDIX    N 
CLASS IFICzVriON    Or'    THE    HOLDOUT    S/-u4PLE    USING    THE 

"nor:-;alized"  model 


Predicted  Groui 


Actual     Total   By  Quadratic  Equation   By  Linear  Equation 
<^^o^-P  Failed     Monfailed     Failed   Nonfailed 


Failed         30       24  6  25 


Nonfailed      30        6  24  7 


Col,  Total     60       30  3  0  3  2 


Note:   Figures  in  parentheses  are  percentages. 


2  93 


b 


(100)   (80.00)      (20.00)     (83.33)    (16.67) 


23 


(100)   (20.00)      (80.00)     (23.33)    (76.67) 


28 


(IOC)   (bO.OO)      (50.00)     (53.33)    (46.67) 
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